YanCotta/post_training_llms

Different post-training techniques for LLMs, including: SFT, DPO and Online RL

PythonStars 4Forks 1Watchers 4Open issues 4License MIT License
Details
仓库信息
OwnerYanCotta
Homepage
Last pushed2025-09-05
Last updated2025-12-14
Issues fetched at

Stats

Community at a glance

Loading...

Loading

--

Loading

--

Loading

--

Loading

--