Hadoop/Pig Cross-join
Posted
by sagie
on Stack Overflow
See other posts from Stack Overflow
or by sagie
Published on 2010-01-28T12:19:48Z
Indexed on
2010/03/30
14:03 UTC
Read the original article
Hit count: 642
Hi I am using Pig to cross join two data sets, both with format. This will result as . In my case, if I have both tuples and , it is a duplication. Can I filter those duplications (or not joining them at all)?
thanks, Sagie
© Stack Overflow or respective owner