radixark/Megatron-LMPython
Ongoing research training transformer models at scale
(8 stars) (4 forks) (0 個已索引 issue) (0 個開放 good first issue)
倉庫
Ongoing research training transformer models at scale
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
The sglang router for miles only.