CarperAI/trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

PythonStars 4730Forks 482Watchers 4730Open issues 102License MIT License
Details
仓库信息
OwnerCarperAI
Homepage
Last pushed2024-01-08
Last updated2025-12-14
Issues fetched at

Stats

Community at a glance

Loading...

Loading

--

Loading

--

Loading

--

Loading

--