radixark/Megatron-LMPython
Ongoing research training transformer models at scale
(8 stars) (4 forks) (0 issues indexadas) (0 good first issues abertas)
Repositórios
Ongoing research training transformer models at scale
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
The sglang router for miles only.