Hadoop/Pig Cross-join

Posted by sagie on Stack Overflow See other posts from Stack Overflow or by sagie
Published on 2010-01-28T12:19:48Z Indexed on 2010/03/30 14:03 UTC
Read the original article Hit count: 642

Filed under:
|
|

Hi I am using Pig to cross join two data sets, both with format. This will result as . In my case, if I have both tuples and , it is a duplication. Can I filter those duplications (or not joining them at all)?

thanks, Sagie

© Stack Overflow or respective owner

Related posts about pig

Related posts about hadoop