postscript to text
Posted
by SANJAY RAO
on Stack Overflow
See other posts from Stack Overflow
or by SANJAY RAO
Published on 2010-05-26T15:35:44Z
Indexed on
2010/05/26
19:51 UTC
Read the original article
Hit count: 241
I am working on a project to convert OCR'd PDf to png using ImageMagick and ghostscript and display in the browser so that i can select words in the image by letting a user query for the word . Imagemagick works fine along with ghostscript .
I have a problem with the ps2text utility where it does not work reliably with pdf's . could anybody suggest a good utility to convert postscript to text in Linux so that i can store it in a db . thereafter i use a custom written search class to find out the co-ordinates of each word and highlight the text in the browser .
Thanks
© Stack Overflow or respective owner