Where can I download a free, text-rich dataset?

Posted by blee on Stack Overflow See other posts from Stack Overflow or by blee
Published on 2010-03-31T18:28:42Z Indexed on 2010/03/31 18:33 UTC
Read the original article Hit count: 584

Filed under:
|
|

I want to do a bit of lightweight testing and bench-marking for full-text search, so the dataset should have the qualities:

  • 10,000 - 100,000 records.
  • good dispersion of English words.
  • In CSV or Excel format--i.e. I don't want to access it via API.

Something like books or movies with title and description fields would be perfect. I browsed the UCI Machine Learning Repo, but it was too number-oriented. Thanks!

© Stack Overflow or respective owner

Related posts about data

Related posts about database