Splitting input into substrings in PIG (Hadoop)

Posted by Niels Basjes on Stack Overflow See other posts from Stack Overflow or by Niels Basjes
Published on 2009-09-09T14:42:17Z Indexed on 2010/06/17 16:43 UTC
Read the original article Hit count: 348

Filed under:
|
|
|

Assume I have the following input in Pig:

some

And I would like to convert that into:

s
so
som
some

I've not (yet) found a way to iterate over a chararray in pig latin. I have found the TOKENIZE function but that splits on word boundries. So can "pig latin" do this or is this something that requires a Java class to do that?

© Stack Overflow or respective owner

Related posts about hadoop

Related posts about mapreduce