How to normalize a word document?
Posted
by AngryHacker
on Super User
See other posts from Super User
or by AngryHacker
Published on 2010-04-20T19:58:24Z
Indexed on
2010/04/20
20:03 UTC
Read the original article
Hit count: 903
I was too cheap to hire someone to retype a really, really long scanned document full of legalese. So I OCRed it using OmniPage. But the OCR output was kind of disappointing. I got a word doc that has multiple line spacings. The before and after paragraph heights are different all over the place.
This would be easy, if the entire document had the same paragraph settings, but it does not. There are probably a half dozen different styles going on.
What is the easiest way to normalize the document? For instance, if one paragraph has a line spacing of 20.4 pt and another one has a spacing of 20.9 pt, then I'd like to consider them the same style and set them to a single value? Or really, any suggestion is welcome at this point.
© Super User or respective owner