scikit-learn/scikit-learn

[WIP] Online Inference for BayesianGaussianMixture

Open

#8,714 opened on 2017年4月6日

GitHub で見る
 (5 comments) (0 reactions) (0 assignees)Python (66,084 stars) (27,020 forks)batch import
Enhancementhelp wantedmodule:mixture

説明

It would be great if someone implemented partial_fit for BayesianGaussianMixture using mean field stochastic variational inference. Fitting with EM is way too slow to use for online clustering on large datasets, and right now the only online clustering method in sklearn is MiniBatchKMeans, which doesn't handle noisy data very well. Alternatively, a parallel Gibbs sampler would work too but is likely a lot more work.

コントリビューターガイド