This can't be done now since algorithm does not own sampler.
貢獻者指南
技術棧
python
領域
machine learning
議題類型
feature
難度面向新貢獻者的預計實作難度,1 表示很小改動,5 表示專家級工作。
3
預計時間有經驗貢獻者完成調查、實作、測試並準備 pull request 的粗略時間範圍。
1-2 days
活動狀態議題目前的可參與程度:新鮮、活躍、陳舊、阻塞或等待維護者輸入。
stale
清晰度議題是否清楚說明預期改動、驗收標準和下一步。
mostly clear
前置要求
Understanding of RL2 algorithmKnowledge of gym environment wrappingFamiliarity with garage's sampler module
新手友善度1-100 的估計分數,表示該議題對首次貢獻者的友善程度。
30
研究方向
Investigate the current relationship between the RL2 algorithm and the sampler in the garage codebase. Look at files such as garage/sampler/ and garage/algos/rl2.py to understand how env wrapping is handled. Determine if ownership of the sampler can be transferred to the algorithm or if a wrapper can be applied at a different level.