DLR-RM/stable-baselines3

[Enhancement]: Wrong gains for weight initialization

Open

#1,559 建立於 2023年6月16日

在 GitHub 查看
 (2 留言) (0 反應) (1 負責人)Python (6,550 star) (1,407 fork)batch import
enhancementhelp wanted

描述

Enhancement

The recommended gains for the weight init depend on the used activation function, see torch docs. However, as for now the used gains are statically implemented and always the same in ActorCriticPolicies. See here.

I recommend making the gains dependent on the activation function used(, i.e. probably mainly ReLU and tanh).

If you agree with this, I would like to implement it myself and PR.

Thanks and a good day!

To Reproduce

--

Relevant log output / Error message

--

System Info

--

Checklist

  • I have checked that there is no similar issue in the repo
  • I have read the documentation
  • I have provided a minimal working example to reproduce the bug
  • I've used the markdown code blocks for both code and stack traces.

貢獻者指南

[Enhancement]: Wrong gains for weight initialization · DLR-RM/stable-baselines3#1559 | Good First Issue