Hadoop 0.2: How to read outputs from TextOutputFormat?
Posted
by S.N
on Stack Overflow
See other posts from Stack Overflow
or by S.N
Published on 2010-04-09T08:46:42Z
Indexed on
2010/04/13
1:02 UTC
Read the original article
Hit count: 438
My reducer class produces outputs with TextOutputFormat (the default OutputFormat given by Job). I like to consume this outputs after the MapReduce job complete to aggregate the outputs. In addition to this, I like to write out the aggregated information with TextInputFormat so that the output from this process can be consumed by the next iteration of MapReduce task. Can anyone give me an example on how to write & read with TextFormat? By the way, the reason why I am using TextFormat, rather Sequence, is the interoperability. The outputs should be consumed by any software.
© Stack Overflow or respective owner