Hadoop and Object Reuse, Why?
Posted
by
Andrew White
on Programmers
See other posts from Programmers
or by Andrew White
Published on 2014-02-11T15:37:35Z
Indexed on
2014/06/11
21:39 UTC
Read the original article
Hit count: 336
In Hadoop, objects passed to reducers are reused. This is extremely surprising and hard to track down if you're not expecting it. Furthermore, the original tracker for this "feature" doesn't offer any evidence that this change actually improved performance (unless I missed it).
It would speed up the system substantially if we reused the keys and values [...] but I think it is worth doing.
This seems completely counter to this very popular answer. Is there some credence to the Hadoop developer's claim? Is there something "special" about Hadoop that would invalidate the notion of object creation being cheap?
© Programmers or respective owner