Hi all
I've simply used the following program on the url below
http://jericho.htmlparser.net/samples/console/src/ExtractText.java
My goal is to be able to extract the main body text, to be able to summarize it and present the summarized text as output to the user.
My problem is that, I'm not sure how I'd modify the above program to only get the required text from the webpage, without the links or any other information.
Again, I'd really appreciate any help I could get.
Thanks in advance