K-Means

Kmeans is one of the most popular clustering algorithm. It is an unsupervised method that can partition the data into k clusters. Kmeans is based on the Euclidean metric, but it can use other metrics. It provides an intuitive way of splitting data into groups.

Our implementation offers several important features:

  • Fast kmeans implementation (for large k) based on trees
  • Smart initialization of the centroids with the kmeans++ algorithm
  • Automatic determination of optimal k based on BIC in batch mode and with the xmeans algorithm
  • Cross validation

Related articles:

Downloading

You can download it from here

Usage examples

Suggestions and bug reports

We would like to know your opinion about our product. Please send us bug reports and suggestions at support@analytics1305.com
If you didn't find an answer to your question, check the forum http://www.analytics1305.com/forum

Table Of Contents

Previous topic

Kernel Discriminant Analysis (KDA), using KDE for classification

Next topic

K-Means