Best way to Fingerprint and Verify html structure.

Posted by Lukas Šalkauskas on Stack Overflow See other posts from Stack Overflow or by Lukas Šalkauskas
Published on 2009-09-29T05:22:48Z Indexed on 2010/04/16 13:53 UTC
Read the original article Hit count: 330

Filed under:
|
|
|
|

Hello there,

I just want to know what is your opinion about how to fingerprint/verify html/links structure.

The problem I want to solve is: fingerprint for example 10 different sites, html pages. And after some time I want to have possibility to verify them, so is, if site has been changed, links changed, verification fails, othervise verification success. My base Idea is to analyze link structure by splitting it in some way, doing some kind of tree, and from that tree generate some kind of code. But I'm still in brainstorm stage, where I need to discuss this with someone, and know other ideas.

So any ideas, algos, and suggestions would be usefull.

© Stack Overflow or respective owner

Related posts about algorithm

Related posts about c#