OCRPages and Result

Discussions about Tesseract OCR integration in GdPicture.
Post Reply
win568
Posts: 44
Joined: Tue Mar 24, 2015 1:00 pm

OCRPages and Result

Post by win568 » Thu Nov 10, 2016 4:11 pm

Hi

I developed 2 Variants of OCR Scanning. The First is to Convert the PDF to a Searchable File with OCRPages() and Read later the Result out of the File. The Second is the "normal" OCR Tesseract Recognition, if there is no Text in the PDF during the recognition Process.

The Advantage of the First Method is the fast OCR Recognition with MultiPage PDF Files. Unfortunatelly there is no function to do this with the OCR Tesseract Methods.

The Problem i had with this issue is, that Signatured Documents or Documents with PDFA Conformance cannot be converted to searchable PDF (loss of Integrity). For this it would be a huge advantage to execute a MultiPage OCR Processing without Writing the Result in the PDF (f.e. Execute with OCRPages and Get the Result in an Event with GetCharLeft/Top/Right/Bottom/Line/Confidence)

win568
Posts: 44
Joined: Tue Mar 24, 2015 1:00 pm

Re: OCRPages and Result

Post by win568 » Wed Nov 16, 2016 11:01 am

Hi

Do you have an advice for me ?? Or should i open a ticket ??

Cedric
Posts: 260
Joined: Sun Sep 02, 2012 7:30 pm

Re: OCRPages and Result

Post by Cedric » Fri Nov 18, 2016 11:13 am

Yes please, open a ticket on the support platform and please make sure you provide all the required elements (source code, input file, etc) so we can replicate the issue on our end.
Thank you.

Post Reply

Who is online

Users browsing this forum: No registered users and 1 guest