Which Hadoop API version should I use?
- by Niels Basjes
In the latest Hadoop Studio the 0.18 API of Hadoop is called "Stable" and the 0.20 API of Hadoop is called "Unstable".
The distribution that comes from Yahoo is a 0.20 (with yahoo patches), which is apparently "the way to go".
From cloudera they state the 0.20 (with cloudera patches) is also stable.
Now given the fact that we'll start coding a new Hadoop project in the next few weeks; which API should we use and which Hadoop distribution (Apache, Cloudera, Yahoo, ...) should we use?
Thanks for your insights.