License key error while loading PDF

Discussions about Tesseract OCR integration in GdPicture.
Post Reply
Cirunz
Posts: 2
Joined: Tue Nov 28, 2017 1:36 pm

License key error while loading PDF

Post by Cirunz » Tue Nov 28, 2017 1:54 pm

Hi, I buyed the GdPicture.NET TWAIN PRO SDK V14 and the Tesseract OCR plugin.
What I'm trying to do is to extract text from a PDF file in a batch operation, so I don't need to show the PDF file to the user. In fact, I can work entirely without a visual interface for this operation: I only need to show the extracted text at the end.

So, I'm using this code, taken from the sdk sample:

Code: Select all

public static bool extracttextfromfile(string filepath, bool moreaccuracy, 
    bool usearea, int AreaLeft, int AreaTop, int AreaWidth, int AreaHeight,
    string whitelist, string blacklist,
    out string resulttext, out string error)
{
    bool result = false;

    resulttext = string.Empty;
    error = string.Empty;
    try
    {
        GdPicture14.LicenseManager lm = new GdPicture14.LicenseManager();
        lm.RegisterKEY(twainlicensekey);//private variable containing the key
        lm.RegisterKEY(pluginlicensekey);//private variable containing the key
        using (GdPictureImaging gdimg = new GdPictureImaging())
        {
            int imagehandle = 0;

            if (GdPictureDocumentUtilities.GetDocumentFormat(filepath) == GdPicture14.DocumentFormat.DocumentFormatPDF)
            {                
                using (GdPicturePDF gdPicturePDF = new GdPicturePDF())
                {
                    if (gdPicturePDF.LoadFromFile(filepath, false) == GdPictureStatus.OK)
                    {
                        imagehandle = gdPicturePDF.RenderPageToGdPictureImageEx(200, true);
                        gdPicturePDF.CloseDocument();
                    }
                }
            
            }
            else
            {
                imagehandle = gdimg.CreateGdPictureImageFromFile(filepath);
            }

            if (imagehandle != 0)
            {
                RotateFlipType pageRotate = (RotateFlipType)(gdimg.TagGetExifRotation(imagehandle));
                if (pageRotate != (RotateFlipType)GdPictureRotateFlipType.GdPictureRotateNoneFlipNone)
                {
                    gdimg.Rotate(imagehandle, pageRotate);
                    gdimg.TagDeleteAll(imagehandle);
                }
            }

            gdimg.OCRTesseractSetOCRContext(OCRContext.OCRContextSingleBlock);//I'm using this one for now, more tests later on the context

            if (usearea)
            {
                gdimg.SetROI(AreaLeft, AreaTop, AreaWidth, AreaHeight);
            }
            else
            {
                gdimg.ResetROI();
            }

            gdimg.OCRTesseractReinit();
            ///ref: http://guides.gdpicture.com/content/webframe.html#Affect%20Tesseract%20OCR%20engine%20with%20special%20parameters.html
            gdimg.OCRTesseractSetVariable("tessedit_char_blacklist", blacklist);

            if (moreaccuracy)
            {
                gdimg.OCRTesseractSetPassCount(0);//0 means all possible passes.
            }
            else
            {
                gdimg.OCRTesseractSetPassCount(1);
            }

            resulttext = gdimg.OCRTesseractDoOCR(imagehandle, "ita", @"D:\GdPicture.NET 14\Redist\OCR\", whitelist);
            if (gdimg.GetStat() == GdPictureStatus.OCRDictionaryNotFound)
            {
                error = "Dizionario non trovato nel percorso specificato!";
            }
            else
            {
                result = !string.IsNullOrEmpty(resulttext);
            }
        }
    }
    catch (Exception err)
    {
        result = false;
        resulttext = string.Empty;
        error = err.Message;
    }

    return result;
}
It works fine, and it extract the text correctly, but it give me a license error on this line:

if (gdPicturePDF.LoadFromFile(filepath, false) == GdPictureStatus.OK)

This is the error:
gdPicture License error.png
gdPicture License error.png (5.29 KiB) Viewed 54 times


What am I doing wrong? I guess the LoadFromFile is somewhat related to pdf viewing library, that I do not have a license for, but the sales support said I can go with the TWAIN and plugin only if I didn't need to show the PDF file on my window.
Is it something else? or there is a way to get the pdf image render without using that function, when I work in background?

Thanks

Coralie
Posts: 1
Joined: Tue Nov 28, 2017 5:32 pm

Re: License key error while loading PDF

Post by Coralie » Tue Nov 28, 2017 5:37 pm

Hi Cirunz,
You are using in your code GdPicturePDF class (GdPicturePDF gdPicturePDF = new GdPicturePDF()) which means you would need to add Managed PDF Plugin to your configuration: http://www.gdpicture.com/products/managed-pdf/.
You can purchase the plugin directly from our site: http://www.gdpicture.com/order/buy-gdpicture/ just tick the box and specify the number of developers.
Otherwise, you can contact our sales team at sales(@) orpalis.com for more details.

Thank you

Cirunz
Posts: 2
Joined: Tue Nov 28, 2017 1:36 pm

Re: License key error while loading PDF

Post by Cirunz » Tue Nov 28, 2017 5:41 pm

Thank you for the information.

So just to be clear: there is no way to do OCR on a PDF without this plugin, even if I don't need to show the PDF on a form?

Post Reply

Who is online

Users browsing this forum: No registered users and 1 guest