HBase as a multimap

Posted by Ibrahim on Stack Overflow See other posts from Stack Overflow or by Ibrahim
Published on 2010-06-15T23:16:43Z Indexed on 2010/06/15 23:22 UTC
Read the original article Hit count: 239

Filed under:
|
|
|

Hi guys,

I'm doing some large scale text processing work and I'm trying to get started with Hadoop and HBase. One of the things I need to do is build a multimap of some stuff, which I later use to look up things and get all items with a certain key (in a M/R job). Would it be OK to use HBase and insert many rows with the same key and rely on versions/timestamps to achieve a multimap-like setup or is this a bad idea? The multimap is built up in the reduce phase of a Mapreduce task by the way, or at least in the way I've formulated it on paper. Thanks!

If more information is needed, I'd be happy to provide it. Not sure whether this question is clear.

© Stack Overflow or respective owner

Related posts about hadoop

Related posts about mapreduce