lmcinnes/umap

Document reproducibility guarantees

Open

#298 ouverte le 25 sept. 2019

Voir sur GitHub
 (16 commentaires) (0 réactions) (0 assignés)Python (751 forks)batch import
0.4documentationgood first issue

Métriques du dépôt

Stars
 (6 478 stars)
Métriques de merge PR
 (Merge moyen 12h 3m) (15 PRs mergées en 30 j)

Description

Following up on the discussion here, it would be good to document how to get reproducible results with UMAP.

I think we should consider changing random_state in the UMAP constructor to a seed (e.g. 42, like the new transform_seed default) so that UMAP is reproducible by default.

We should document that users can set random_state to None to get faster results at the expense of reproducibility. In this mode there is no seed that would produce the same output due to the multithreading. (This was introduced in #294.)

Guide contributeur