Extract hindi characters from pdf

Posted by Eknath Iyer on Stack Overflow See other posts from Stack Overflow or by Eknath Iyer
Published on 2010-05-20T03:42:07Z Indexed on 2010/05/20 3:50 UTC
Read the original article Hit count: 248

Filed under:

I have a pdf file with English and Hindi Text in it and I need to extract text into raw text(utf-8).

I tried using openoffice but the hindi characters get ruined

© Stack Overflow or respective owner

Related posts about pdf