Python KMeans clustering words
Posted
by sadawd
on Stack Overflow
See other posts from Stack Overflow
or by sadawd
Published on 2010-03-17T03:29:50Z
Indexed on
2010/03/17
3:31 UTC
Read the original article
Hit count: 340
python
|clustering
Dear Everyone I am interested to perform kmeans clustering on a list of words with the distance measure being Leveshtein.
1) I know there are a lot of frameworks out there, including scipy and orange that has a kmeans implementation. However they all require some sort of vector as the data which doesn't really fit me.
2) I need a good clustering implementation. I looked at python-clustering and realize that it doesn't a) return the sum of all the distance to each centroid, and b) it doesn't have any sort of iteration limit or cut off which ensures the quality of the clustering. python-clustering and the clustering algorithm on daniweb doesn't really work for me.
Can someone find me a good lib? Google hasn't been my friend
© Stack Overflow or respective owner