Question

Is there a way to OCR incoming PDFs that are faxed in order to make them searchable

5 years ago
2 August 2018
2 replies
727 views

aaron11128
New Participant
0 replies

2 replies

Anonymous
0 replies
5 years ago
2 August 2018

You can do this buy retrieving the PDF and using an OCR API or the Tesseract Open Source package.

One API that can be used is the Google Vision API:

https://cloud.google.com/vision/docs/pdf

The Tesseract Open Source OCR engine is generally considered one of, if not, the best open source solutions:

https://github.com/tesseract-ocr/tesseract

Tyler850957020
Community Manager
548 replies
5 years ago
3 August 2018

Just want to mention that OCR is not the only way to extract text from PDF. If the PDF's content is text instead of image, you can use some library to extract the text. Search GitHub for "pdf to text".

Reply

PRODUCTS
RingEX
Message
Video
Phone

OPEN ECOSYSTEM
Developer Platform
APIs
Integrated Apps
App Gallery
Developer support
Games and rewards

RESOURCES
Resource center
Blog
Product Releases
Accessibility

QUICK LINKS
App Download
RingCentral App login
Admin Portal Login
Contact Sales

Reply

Sign up

Login with SSO

Login to the community

Login with SSO

Scanning file for viruses.

This file cannot be downloaded