Is there anything for Python that is like readability.js?

Posted by Emre Sevinç on Stack Overflow See other posts from Stack Overflow or by Emre Sevinç
Published on 2010-05-27T12:53:27Z Indexed on 2010/05/28 14:02 UTC
Read the original article Hit count: 367

Hi,

I'm looking for a package / module / function etc. that is approximately the Python equivalent of Arc90's readability.js

http://lab.arc90.com/experiments/readability

http://lab.arc90.com/experiments/readability/js/readability.js

so that I can give it some input.html and the result is cleaned up version of that html page's "main text". I want this so that I can use it on the server-side (unlike the JS version that runs only on browser side).

Any ideas?

PS: I have tried Rhino + env.js and that combination works but the performance is unacceptable it takes minutes to clean up most of the html content :( (still couldn't find why there is such a big performance difference).

© Stack Overflow or respective owner

Related posts about JavaScript

Related posts about python