Can not recognize pdf scanned page with greek words by using PB , EZTWAIN and TOCR 3.0

Posted by sgian76 on Stack Overflow See other posts from Stack Overflow or by sgian76
Published on 2010-04-08T09:14:20Z Indexed on 2010/04/22 20:33 UTC
Read the original article Hit count: 203

Filed under:
|

Hi, Iam using PB 10.5.2 and EZTwain 3.30.0.28, XDefs 1.36b1 by Dosadi for scanning.

Also Iam using the TOCR 3.0 for OCR management.

In a function we use the following among all others :

...

Long ll_acquire

(as_path_filename is a function argument)

...

...

TWAIN_SetAutoOCR(1)

ll_acquire = TWAIN_AcquireMultipageFile(0, as_path_filename)

the problem is that the scanned pdf page has latin (english) and greek words. The English characters are searched quite precisely but the greek don't at all.

Do you think this that this has to do with the TOCR software. I just want to search AND for greek words

Thanks in advance

© Stack Overflow or respective owner

Related posts about ocr

Related posts about powerbuilder