Is there anything for Python that is like readability.js?
Posted
by Emre Sevinç
on Stack Overflow
See other posts from Stack Overflow
or by Emre Sevinç
Published on 2010-05-27T12:53:27Z
Indexed on
2010/05/28
14:02 UTC
Read the original article
Hit count: 367
Hi,
I'm looking for a package / module / function etc. that is approximately the Python equivalent of Arc90's readability.js
http://lab.arc90.com/experiments/readability
http://lab.arc90.com/experiments/readability/js/readability.js
so that I can give it some input.html and the result is cleaned up version of that html page's "main text". I want this so that I can use it on the server-side (unlike the JS version that runs only on browser side).
Any ideas?
PS: I have tried Rhino + env.js and that combination works but the performance is unacceptable it takes minutes to clean up most of the html content :( (still couldn't find why there is such a big performance difference).
© Stack Overflow or respective owner