A ruby library which implements clustering algorithms for text mining.
Currently implemented algorithms are K-Means, and Hierarchical clustering.
Hierarchical gives better results, but complexity roughly O(n*n)
K-means is very fast, O(k*n*i), i is number of iterations.
the examples need google/yahoo api keys, and the yahoo example requires ysearch-rb from
developer.yahoo.com/download/download.html
Hybrid clustering algorithms + more similarity metrics + semi-supervised clustering… coming soon … (or submit pacthes/show keen interest, if you want faster results)
Happy hacking.…..