Search Results

Search found 3 results on 1 pages for 'htmlparsing'.

Page 1/1 | 1 

  • need help working with the Jericho Html Parser

    - by rookie
    Hi all I've simply used the following program on the url below http://jericho.htmlparser.net/samples/console/src/ExtractText.java My goal is to be able to extract the main body text, to be able to summarize it and present the summarized text as output to the user. My problem is that, I'm not sure how I'd modify the above program to only get the required text from the webpage, without the links or any other information. Again, I'd really appreciate any help I could get. Thanks in advance

    Read the article

  • Which is the best HTML tidy pack? Is there any option in HTML agility pack to make HTML webpage tidy

    - by Harikrishna
    I am using html agility pack to parse html tabular information. Now there is some html content with missing ending tags and from such page because of missing ending tags html agility pack does not parse information properly.So I want to insert ending tags where there are missing ending tags so html agility pack parse information properly. So to insert the missing ending tags what should I do ?Should I do write my own code for that or use html tidy pack to do that ? If html tidy pack then which is the best html tidy pack,and how to use it any example if possible ? And if my own code than what it can be like ? Is there any option in html agility pack which can make us able to first make the html page tidy and then parse the webpage.

    Read the article

1