CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Details
仓库信息
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Stats
Loading...
Loading
--
Loading
--
Loading
--
Loading
--