Free Large datasets to experiment with Hadoop
Posted
by Sundar
on Stack Overflow
See other posts from Stack Overflow
or by Sundar
Published on 2010-04-20T10:54:11Z
Indexed on
2010/04/22
22:23 UTC
Read the original article
Hit count: 505
Do you know any large datasets to experiment with Hadoop which is free/low cost? Any pointers/links related is appreciated.
Prefernce:
Atleast one GB of data.
Production log data of webserver.
Few of them which I found so far:
Also can we run our own crawler to gather data from sites e.g. Wikipedia? Any pointers on how to do this is appreciated as well.
© Stack Overflow or respective owner