Why do I get an extra newline in the middle of a UTF-8 character with XML::Parser?
- by René Nyffenegger
I encountered a problem dealing with UTF-8, XML and Perl. The following is the smallest
piece of code and data in order to reproduce the problem.
Here's an XML file that needs to be parsed:
<?xml version="1.0" encoding="utf-8"?>
<test>
<words>???????????? ??????? ????????? ???? ???????????? ??????</words>
…