scikit-learn/scikit-learn

Add Kmeans parameter for pruning small clusters

Open

#848 opened on May 10, 2012

View on GitHub
 (6 comments) (0 reactions) (0 assignees)Python (66,084 stars) (27,020 forks)batch import
ModerateNew Featurehelp wantedmodule:cluster

Description

In Kmeans, often some clusters have only very little data. This might happen for all random initializations. For this case, I would like to have an option to set a minimum cluster size, after which a cluster is dropped and a new one is created.

Contributor guide