BonhamLab/microbiome2function
A modular Python toolkit for mining UniProt protein data and converting it into machine-learning ready features. Handles everything from HUMAnN outputs to clean numerical representations (embeddings + multi-hot encodings) for downstream ML models. Features intelligent text cleaning, ESM-2 sequence embeddings, and GO/EC encoding.
Details
仓库信息