Ruby : UTF-8 IO
- by subtenante
I use ruby 1.8.7.
I try to parse some text files containing greek sentences, encoded in UTF-8.
(I can't much paste here sample files, because they are subject to copyright. Really just some greek text encoded in UTF-8.)
I want, for each file, to parse the file, extract all the words, and make a list of each new word found in this file. All…