How to parse XML with special characters?
Posted
by Snooze
on Stack Overflow
See other posts from Stack Overflow
or by Snooze
Published on 2010-06-05T07:37:48Z
Indexed on
2010/06/05
7:42 UTC
Read the original article
Hit count: 289
Whenever I try to parse XML with special characters such as o or ???? I get an error. The xml documents claims to use UTF-8 encoding but that does not seem to be the case. Here is what the troublesome text looks like when I view the XML in Firefox:
Bleach: The Diamond Dust Rebellion - MÅ? Hitotsu no HyÅ?rinmaru; Bleach - The DiamondDust Rebellion - Mou Hitotsu no Hyourinmaru
On the actual website, Å? is actually the character o.
<br /> One day, Doraemon and his friends meet Professor Mangetsu (æº?æ??å??ç??, Professor Mangetsu?), who studies magic and magical beings such as goblins, and his daughter Miyoko (ç¾?å¤?å?, Miyoko?), and are warned of the dangerous approximation of the "star of the Underworld" to the Earth's orbit.<br /> <br />
And once again, on the actual website, those characters appear as ???? and ???.
The actual XML file is formatted properly other than those special characters, which certainly do not appear to be using the UTF-8 encoding. Is there a way to get NSXML to parse these XML files?
© Stack Overflow or respective owner