0 comments (0 comments)0 reactions (0 reactions)0 assignees (0 assignees)Python1,728 stars (1,728 stars)297 forks (297 forks)batch import
good first issuetf
Description
This can't be done now since algorithm does not own sampler.
Contributor guide
- Tech stack
- python
- Domain
- machine learning
- Issue type
- feature
- DifficultyEstimated implementation difficulty for a new contributor, from 1 for very small changes to 5 for expert-level work.
- 3
- Estimated timeA rough time range for an experienced contributor to investigate, implement, test, and prepare a pull request.
- 1-2 days
- Activity statusHow available the issue appears right now: fresh, active, stale, blocked, or waiting on maintainer input.
- stale
- ClarityHow clearly the issue explains the expected change, acceptance criteria, and next step.
- mostly clear
- Prerequisites
- Understanding of RL2 algorithmKnowledge of gym environment wrappingFamiliarity with garage's sampler module
- Newbie friendlinessA 1-100 score estimating how approachable this issue is for first-time contributors.
- 30
- Research direction
- Investigate the current relationship between the RL2 algorithm and the sampler in the garage codebase. Look at files such as garage/sampler/ and garage/algos/rl2.py to understand how env wrapping is handled. Determine if ownership of the sampler can be transferred to the algorithm or if a wrapper can be applied at a different level.