Complicated parsing in python
Posted
by
Quazi Farhan
on Stack Overflow
See other posts from Stack Overflow
or by Quazi Farhan
Published on 2012-04-09T23:00:49Z
Indexed on
2012/04/09
23:29 UTC
Read the original article
Hit count: 274
I have a weird parsing problem with python. I need to parse the following text.
Here I need only the section between(not including) "pre" tag and column of numbers (starting with 205 4 164). I have several pages in this format.
<html>
<pre>
A Short Study of Notation Efficiency
CACM August, 1960
Smith Jr., H. J.
CA600802 JB March 20, 1978 9:02 PM
205 4 164
210 4 164
214 4 164
642 4 164
1 5 164
</pre>
</html>
© Stack Overflow or respective owner