THUDM/slime

slime is an LLM post-training framework for RL Scaling.

PythonStars 2833Forks 328Watchers 2833Open issues 132License Apache License 2.0
Details
仓库信息
OwnerTHUDM
Last pushed2025-12-13
Last updated2025-12-14
Issues fetched at

Stats

Community at a glance

Loading...

Loading

--

Loading

--

Loading

--

Loading

--