Asp.net library to extract plain text from docx, pptx, xlsx (for search index)
- by Myster
Is there a pre-existing library to extract plain text form docx, pptx, and xlsx files?
I require this to populate a lucene.net index.
I've found this example which extracts text from docx and it seems to work ok.
But before building my own solution based on this I was wondering if there's something already available for the other file formats?