Information Extraction Toolkits
Posted
by MathGladiator
on Stack Overflow
See other posts from Stack Overflow
or by MathGladiator
Published on 2010-04-25T02:19:13Z
Indexed on
2010/04/25
2:23 UTC
Read the original article
Hit count: 296
I'm looking for information extraction libraries where I can have semi structured information that may have either hidden or incomplete data. I want to train some classifiers to pull out content based on the structure.
I'm working on building a tool where I can select text in the browser, and it will generate (via some web service call) a classifier that can be used on other documents to pull out text.
I'm primarily looking at how the structure of the document can be used to indicate what the content is.
© Stack Overflow or respective owner