scikit-learn/scikit-learn
View on GitHubAdd Kmeans parameter for pruning small clusters
Open
#848 opened on May 10, 2012
ModerateNew Featurehelp wantedmodule:cluster
Description
In Kmeans, often some clusters have only very little data. This might happen for all random initializations. For this case, I would like to have an option to set a minimum cluster size, after which a cluster is dropped and a new one is created.