© 1999-2024 RingCentral, Inc. All rights reserved.
Privacy Notice
Is there a way to OCR incoming PDFs that are faxed in order to make them searchable
Page 1 / 1
You can do this buy retrieving the PDF and using an OCR API or the Tesseract Open Source package.
One API that can be used is the Google Vision API:
https://cloud.google.com/vision/docs/pdf
The Tesseract Open Source OCR engine is generally considered one of, if not, the best open source solutions:
https://github.com/tesseract-ocr/tesseract
One API that can be used is the Google Vision API:
https://cloud.google.com/vision/docs/pdf
The Tesseract Open Source OCR engine is generally considered one of, if not, the best open source solutions:
https://github.com/tesseract-ocr/tesseract
Just want to mention that OCR is not the only way to extract text from PDF. If the PDF's content is text instead of image, you can use some library to extract the text. Search GitHub for "pdf to text".
Reply
Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.