[Enhancement]: Wrong gains for weight initialization · DLR-RM/stable-baselines3#1559

(2 comments) (0 reactions) (1 assignee)Python (1,407 forks)batch import

enhancementhelp wanted

Repository metrics

Stars: (6,550 stars)
PR merge metrics: (平均マージ 11d 13h) (30d で 3 merged PRs)

説明

Enhancement

The recommended gains for the weight init depend on the used activation function, see torch docs. However, as for now the used gains are statically implemented and always the same in ActorCriticPolicies. See here.

I recommend making the gains dependent on the activation function used(, i.e. probably mainly ReLU and tanh).

If you agree with this, I would like to implement it myself and PR.

Thanks and a good day!

To Reproduce

Relevant log output / Error message

--

System Info

Checklist

I have checked that there is no similar issue in the repo
I have read the documentation
I have provided a minimal working example to reproduce the bug
I've used the markdown code blocks for both code and stack traces.

コントリビューターガイド

調査方針: ActorCriticPoliciesの現在の重み初期化を調査し、ゲインが静的に設定されている場所を特定し、PyTorchのドキュメントに従ってアクティベーション関数固有のゲインを使用するように変更します。
技術スタック: pythonpytorch
領域: machine learningai
Issue 種別: 機能
難度: 2
推定時間: 1-3時間
活動状況: アクティブ
明確さ: 明確
前提条件: PythonPyTorchGit
初心者向け度: 75