Loading...

OCR new Language

Support for GdPicture Tessaract Plugin.

OCR new Language

Postby RobertH » Mon Feb 07, 2011 3:54 pm

I need to use the OCR component to do an OCR on newspapers. As I am in Sough Africa, the OCR also needs to run on the Afrikaans newspapers.

Now I assume that there is no language plug-in for Afrikaans. So what will happen if I try and run the OCR on an Afrikaans page with English specified as a language? Will it do an OCR on the page, ignoring the words that it does not find a match for? Is there a way for me to create a dictionary file for Afrikaans?
RobertH
 
Posts: 3
Joined: Mon Sep 28, 2009 8:58 am

Re: OCR new Language

Postby Loïc » Thu Feb 10, 2011 5:53 pm

Hi Robert,

Unfortunately we do not provide support for creating new dictionary. Next version of the engine will come with a bunch of new languages, today I am not able to say if Afrikaans will be available or not.

Thank you for your comprehension.

kind regards,

Loïc
Loïc Carrère, support team.
www.orpalis.com
User avatar
Loïc
Site Admin
 
Posts: 4437
Joined: Tue Oct 17, 2006 10:48 pm
Location: France

Re: OCR new Language

Postby Mcb2000 » Tue Mar 08, 2011 5:30 pm

Hello Loïc

Do you have a new estimate on when that version will become available?

Regards,
Martin
Mcb2000
 
Posts: 3
Joined: Wed Aug 04, 2010 1:45 pm

Re: OCR new Language

Postby Loïc » Tue Mar 08, 2011 6:34 pm

Hi Martin,

We have to finish first our next 2 Plugins (64-bit PDF & Annotation), new Tesseract 3 engine gateway will follow. This will takes probably between 3 and 6 months.

Kind regards,

Loïc
Loïc Carrère, support team.
www.orpalis.com
User avatar
Loïc
Site Admin
 
Posts: 4437
Joined: Tue Oct 17, 2006 10:48 pm
Location: France


Return to GdPicture Tesseract OCR Engine Plugin

Who is online

Users browsing this forum: No registered users and 0 guests