Python: How to extract xml embedded in a html file?
- by georgehu
I have a html file with xml snipped embedded, the source code is pasted in the pastbin:
http://pastebin.com/Hy0QaWk8
my task is to extract the text enclosed in the first textarea, which is a xml snippet, from the html. Without any change to the original snippet. I'm able to get it by using the BeautifulSoup, but it changes all the tag names into lower case.