HumanCompatibleAI/population-irl
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
Details
仓库信息
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
Stats
Loading...
Loading
--
Loading
--
Loading
--
Loading
--