radixark/Megatron-LMPython
Ongoing research training transformer models at scale
(8 stars) (4 forks) (0 issues indexées) (0 good first issues ouvertes)
Dépôts
Ongoing research training transformer models at scale
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
The sglang router for miles only.