Splitting input into substrings in PIG (Hadoop)
Posted
by Niels Basjes
on Stack Overflow
See other posts from Stack Overflow
or by Niels Basjes
Published on 2009-09-09T14:42:17Z
Indexed on
2010/06/17
16:43 UTC
Read the original article
Hit count: 348
Assume I have the following input in Pig:
some
And I would like to convert that into:
s
so
som
some
I've not (yet) found a way to iterate over a chararray in pig latin. I have found the TOKENIZE function but that splits on word boundries. So can "pig latin" do this or is this something that requires a Java class to do that?
© Stack Overflow or respective owner