radixark/Megatron-LMPython
Ongoing research training transformer models at scale
(8 Stars) (4 Forks) (0 indexierte Issues) (0 offene good first issues)
Repositories
Ongoing research training transformer models at scale
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
The sglang router for miles only.