Where can I download a free, text-rich dataset?
Posted
by blee
on Stack Overflow
See other posts from Stack Overflow
or by blee
Published on 2010-03-31T18:28:42Z
Indexed on
2010/03/31
18:33 UTC
Read the original article
Hit count: 582
I want to do a bit of lightweight testing and bench-marking for full-text search, so the dataset should have the qualities:
- 10,000 - 100,000 records.
- good dispersion of English words.
- In CSV or Excel format--i.e. I don't want to access it via API.
Something like books or movies with title and description fields would be perfect. I browsed the UCI Machine Learning Repo, but it was too number-oriented. Thanks!
© Stack Overflow or respective owner