DLR-RM/stable-baselines3
在 GitHub 查看[Feature Request] independently configurable learning rates for actor and critic
Open
#338 创建于 2021年3月3日
enhancementhelp wanted
描述
🚀 Feature
independently configurable learning rates for actor and critic in AC-style algorithms
Motivation
In literature the actor is often configured to learn slower, such that the critics responses are more reliable. At least it would be nice if i could allow my hyperparameter optimizer to decide which learning rates he wants to use for actor or critic.
Pitch
Additional context
https://spinningup.openai.com/en/latest/algorithms/ddpg.html#documentation-pytorch-version