Convert ISO-8859-1 to UTF-8

Posted by tau on Stack Overflow See other posts from Stack Overflow or by tau
Published on 2010-04-17T00:06:18Z Indexed on 2010/04/17 0:13 UTC
Read the original article Hit count: 673

Filed under:
|
|
|
|

I have several documents I need to convert from ISO-8859-1 to UTF-8 (without the BOM of course). This is the issue though. I have so many of these documents (it is actually a mix of documents, some UTF-8 and some ISO-8859-1) that I need an automated way of converting them. Unfortunately I only have ActivePerl installed and don't know much about encoding in that language. I may be able to install PHP, but I am not sure as this is not my personal computer.

Just so you know, I use Scite or Notepad++, but both do not convert correctly. For example, if I open a document in Czech that contains the character "ž" and go to the "Convert to UTF-8" option in Notepad++, it incorrectly converts it to an unreadable character.

There is a way I CAN convert them, but it is tedious. If I open the document with the special characters and copy the document to Windows clipboard, then paste it into a UTF-8 document and save it, it is okay. This is too tedious (opening every file and copying/pasting into a new document) for the amount of documents I have.

Any ideas? Thanks!!!

© Stack Overflow or respective owner

Related posts about iso-8859-1

Related posts about perl