HumanCompatibleAI/population-irl

(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards

PythonStars 27Forks 2Watchers 27Open issues 7License MIT License
Details
仓库信息
OwnerHumanCompatibleAI
Homepage
Last pushed2019-06-20
Last updated2025-12-15
Issues fetched at

Stats

Community at a glance

Loading...

Loading

--

Loading

--

Loading

--

Loading

--