Python KMeans clustering words

Posted by sadawd on Stack Overflow See other posts from Stack Overflow or by sadawd
Published on 2010-03-17T03:29:50Z Indexed on 2010/03/17 3:31 UTC
Read the original article Hit count: 340

Filed under:
|

Dear Everyone I am interested to perform kmeans clustering on a list of words with the distance measure being Leveshtein.

1) I know there are a lot of frameworks out there, including scipy and orange that has a kmeans implementation. However they all require some sort of vector as the data which doesn't really fit me.

2) I need a good clustering implementation. I looked at python-clustering and realize that it doesn't a) return the sum of all the distance to each centroid, and b) it doesn't have any sort of iteration limit or cut off which ensures the quality of the clustering. python-clustering and the clustering algorithm on daniweb doesn't really work for me.

Can someone find me a good lib? Google hasn't been my friend

© Stack Overflow or respective owner

Related posts about python

Related posts about clustering