OCRTesseractDoOCR() returning blank result with custom creat

Discussions about Tesseract OCR integration in GdPicture.
Post Reply
IXDev2
Posts: 2
Joined: Mon Nov 03, 2014 3:37 pm

OCRTesseractDoOCR() returning blank result with custom creat

Post by IXDev2 » Tue Nov 04, 2014 4:17 pm

I have created one .traineddata file by following steps given at TrainingTesseract3(https://code.google.com/p/tesseract-ocr ... Tesseract3) .
Inside attachment under "GDPictureOCRIssueWithCustomTraineddataFile\GDPictureOCRIssueWithCustomTraineddataFileTestData\TestData\CustomDictionary" folder you can find custom.traineddata file, but when I am using this custom.traineddata file with OCRTesseractDoOCR() I am getting blank result with the error "actual_tessdata_num_entries_ <= TESSDATA_NUM_ENTRIES:Error:Assert failed:in file ..\ccuti
l\tessdatamanager.cpp, line 48" .

Inside attachment I have provided datafiles which I used to create the custom.traineddata file under "GDPictureOCRIssueWithCustomTraineddataFile\GDPictureOCRIssueWithCustomTraineddataFileTestData\TestData\DataUsedForCreatingCustomDictionary\" folder . I have also provided sample application inside the attachment for this problem. We are using GDPicture.NET 9.

I also found that if I use this created custom.traineddata file with Google Tesseract-ocr (tesseract.exe) it is gives proper result for the input.

What could be problem with OCRTesseractDoOCR()? Am I missing any data file for creating the .traineddata file?
Attachments
GDPictureOCRIssueWithCustomTraineddataFile.7z
(1.54 MiB) Downloaded 80 times

David
Posts: 66
Joined: Mon Feb 08, 2016 3:12 pm

Re: OCRTesseractDoOCR() returning blank result with custom c

Post by David » Mon Feb 08, 2016 5:20 pm

Hi,

The reason for the error could be a different trained data file format, different Tesseract engine version, etc. As suggested in the other post, could you please give us more details about the ocr challenges you want to solve?

May I ask you to share the input data for the training process and your training procedure? This way I would be able to try and reproduce your issue using the latest versions of GdPicture and Tesseract tools.

Regards,

David

Post Reply

Who is online

Users browsing this forum: No registered users and 2 guests