Make it easier to use different reward functions in rl_tuner · magenta/magenta#375

(1 comment) (1 reaction) (0 assignees)Python (18,909 stars) (3,723 forks)batch import

help wanted

説明

The reward calculation code should be factored out into a separate class that implements an interface. This would make it easier for users to create their own reward implementations.

コントリビューターガイド

技術スタック: python
領域: machine learning
Issue 種別: refactor
難度: 3
推定時間: half day
活動状況: stale
明確さ: clear
前提条件: Pythonbasic OOP
初心者向け度: 70
調査方針: Examine the current reward function implementation in the rl tuner module, likely in files like rl tuner.py or similar. Identify the reward calculation logic and propose a refactored class structure with a clear interface. Check any existing tests to ensure backward compatibility.