Python: How to extract xml embedded in a html file?

Posted by georgehu on Stack Overflow See other posts from Stack Overflow or by georgehu
Published on 2010-04-26T22:26:49Z Indexed on 2010/04/26 22:43 UTC
Read the original article Hit count: 203

Filed under:
|
|

I have a html file with xml snipped embedded, the source code is pasted in the pastbin:

http://pastebin.com/Hy0QaWk8

my task is to extract the text enclosed in the first textarea, which is a xml snippet, from the html. Without any change to the original snippet. I'm able to get it by using the BeautifulSoup, but it changes all the tag names into lower case.

© Stack Overflow or respective owner

Related posts about python

Related posts about Xml