Reverse Engineer a web page

Posted by Phil on Stack Overflow See other posts from Stack Overflow or by Phil
Published on 2010-05-30T11:17:30Z Indexed on 2010/05/30 11:22 UTC
Read the original article Hit count: 232

Hi, I wish to reverse engineer any web-page into a logical representation of the page. For example, if a web page has a menu, then I want a logical menu structure perhaps in XML. If the webpage has an article, I want a article XML node, if it has a title for the article I want a title XML node. Basically, I want the logical form of the web-page without any of the user interface.

This logical model could either be objects in code or XML it doesn't matter, the important part is that it has identified what everything on the page means.

© Stack Overflow or respective owner

Related posts about reverse-engineering

Related posts about webpage