Where can I get the 10k common English dictionary words which Stack overflow uses in related question? [migrated]
- by itpian.com
Where can I get the 10k common English dictionary words which Stack overflow uses in related question?
Here in SE podcast -
http://blog.stackoverflow.com/2008/12/podcast-32/
One of our major performance optimizations for the “related questions”
query is removing the top 10,000 most common English dictionary words
(as determined by Google search) before submitting the query to the
SQL Server 2008 full text engine. It’s shocking how little is left of
most posts once you remove the top 10k English dictionary words. This
helps limit and narrow the returned results, which makes the query
dramatically faster.