Is their an optimal config/format for a TIFF when using Tesseract or other OCR?

Posted by Zando on Stack Overflow See other posts from Stack Overflow or by Zando
Published on 2010-04-19T22:29:02Z Indexed on 2010/04/19 22:33 UTC
Read the original article Hit count: 372

Filed under:

image-processing

|

image-manipulation

|

tiff

|

ocr

|

tesseract

I'm having a bizarre problem with Tesseract. I have a name, "Janice" that is in a 200x40 pixel tiff, that Tesseract interprets as a blank. I'm running hundreds of names through Tesseract and they are processed fine.

What I'm actually doing, though, is breaking up a larger TIFF into smaller tiffs of one word each. In the larger TIFF, tesseract recognizes "Janice".

What could cause it to hiccup in a TIFF that solely contains that word (and there's enough space around the word to not truncate any of the pixels)? I'm using ImageMagick to split the big TIFF, are there options I should set when reconstituting the new TIFF files?

© Stack Overflow or respective owner

Related posts about image-processing

Basic image processing

as seen on Stack Overflow - Search for 'Stack Overflow'
i have two image of a parking space.one is empty.and another is with some car.now how can i detect the empty space in the image?or how can i detect the car in the image? >>> More
basic image processing

as seen on Stack Overflow - Search for 'Stack Overflow'
what is noise in an image? >>> More
image processing

as seen on Stack Overflow - Search for 'Stack Overflow'
where i can find useful tutorials about background subtraction?what are the algorithms? >>> More
open source image processing lib in java

as seen on Stack Overflow - Search for 'Stack Overflow'
can any one suggest a good open soucre image processing lib in java? i want to devleop a devlop a ORM reader using it. >>> More
Photoshop batch image processing based on EXIF?

as seen on Super User - Search for 'Super User'
I use Photoshop to batch convert my RAW files to JPG. I was wondering if there was a way to make it take different actions based on EXIF? With my particular camera lens, if I open it all the way to F1.7 there is noticeable vignetting, but stopping down smaller doesn't have that problem. What I'd like… >>> More

Related posts about image-manipulation

Image Manipulation in Multitouch Development

as seen on Code Project - Search for 'Code Project'
In this article I will describe about the Image manipulation in Windows 7 multitouch Environment >>> More
[jQuery] [PHP] Image manipulation

as seen on Stack Overflow - Search for 'Stack Overflow'
hello, I want to do some kind of image editor, after I upload more images i want to make a list with all the thumbnails! after i want to be able to click on one thumb and rotate, duplicate, drag and drop (to change positions of the images), delete the image! all the images i want to be in a php… >>> More
Image manipulation

as seen on Stack Overflow - Search for 'Stack Overflow'
Hi, I am just wondering what kind of computing/programming language/frameworks are needed to produce images such as the one in http://www.erdas.com/ ? Programmatically, how does one produce the general spatial analysis images ? ps: I use java most of the time. Thanks >>> More
very large image manipulation and tiling

as seen on Stack Overflow - Search for 'Stack Overflow'
I need to a software , Program(Java),or a method for tiling very larg images (more than 140MB). I have used imagemagic and convert tools photoshop and corel draw and matlab (in win os) but I have problem with memory amount.and memory is not enough.imagemagic is very slow and result is not desirable… >>> More
Codeigniter image manipulation class rotates image during resize

as seen on Stack Overflow - Search for 'Stack Overflow'
I'm using Codeigniter's image manipulation library to re-size an uploaded image to three sizes, small, normal and large. The re-sizing is working great. However, if I'm resizing a vertical image, the library is rotating the image so it's horizontal. These are the config settings I have in place: … >>> More