Python: How to extract xml embedded in a html file?
Posted
by georgehu
on Stack Overflow
See other posts from Stack Overflow
or by georgehu
Published on 2010-04-26T22:26:49Z
Indexed on
2010/04/26
22:43 UTC
Read the original article
Hit count: 203
I have a html file with xml snipped embedded, the source code is pasted in the pastbin:
my task is to extract the text enclosed in the first textarea, which is a xml snippet, from the html. Without any change to the original snippet. I'm able to get it by using the BeautifulSoup, but it changes all the tag names into lower case.
© Stack Overflow or respective owner