Extracting text from PDF with Poppler (C++)
- by nico
I'm trying to get my way through Poppler and its (lack of) documentation.
What I want to do is a very simple thing: open a PDF file and read the text in it. I'm then going to process the text, but that doesn't really matter here.
So... I saw the poppler_page_get_text function, and it kind of works, but I have to specify a selection rectangle, which is not very handy. Isn't there just a very simple function that would output the PDF text in order (maybe line by line?).
Thank you
Nicola