Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Dépôts
Dépôts de lmcinnes
A deck.gl composite layer providing level of detail text support
A repository for public storage of slides given at the 17th Python in Science Conferences (2018)
Tools for word and document embedding using UMAP
Benchmarks of approximate nearest neighbor libraries in Python
apricot implements submodular optimization for the purpose of selecting subsets of massive data sets to train machine learning models quickly. See the documentation page: https://apricot-select.readthedocs.io/en/latest/index.html
A flexible Bayesian approach to t-SNE dimension reduction.
Interactive Web Plotting for Python
A conda-smithy repository for conda-forge-pinning.
A collection of talks and tutorials from conferences I attend
Hosting examples of interactive datamapplot output
Quickly and accurately render even the largest data.
just a bunch of useful embeddings
Ensemble topic modelling with pLSA
An ungodly union of GitHub and Figshare
Computations and statistics on manifolds with geometric structures.
Algorithmically create or extend categorical colour palettes
A high performance implementation of HDBSCAN clustering. http://hdbscan.readthedocs.io/en/latest/
A conda-smithy repository for hdbscan.