Looking for recommendations on OCR problem - tabular numeric data

Posted by ldigas on Super User See other posts from Super User or by ldigas
Published on 2010-04-01T22:23:49Z Indexed on 2010/04/01 22:33 UTC
Read the original article Hit count: 387

Filed under:

I have 20 pages of experiment measurement data which I need to digitalize. The results are in tabular form, scanned in 600 dpi resolution, and as far as scans go, they came up pretty clean and readable.

For an example of how it looks see here (but beware: it is a rather big scan; about 5Mb; no problem for any broadband connection, but dialups should approach with caution!)

... and I need it finished by sunday afternoon (:-o) <-- smiley in a state of panic

(then why did't you start sooner?)... yea, yeah ... I know ... but, it came up late, and I wasn't thinking I was gonna need this data also.

So, I'm looking for recommendations. I haven't much experience with OCR programs, save scanning a page or two of pure text, but just to mention, I haven't the wish also to test out every OCR program out there. So this isn't a "name your OCR favourite".

What I'm looking is advice from someone who's done something like that, and his/hers experience on what would be the best way to undertake.

I need the data in txt form but since it will have to be checked (by drawing it, and just simply watching whether some points "jump out") I'll probably be entering it in Excel at first.

© Super User or respective owner

Related posts about ocr