Multi-decoder setup for multi-task learning · OpenNMT/OpenNMT-tf#130

(5 comments) (0 reactions) (0 assignees)Python (394 forks)batch import

enhancementhelp wanted

Repository metrics

Stars: (1,428 stars)
PR merge metrics: (30d に merged PR はありません)

説明

Similarly to the ParallelEncoder, a ParallelDecoder setup could allow multi-task learning. This should not be too hard to implement but we need to take care of some details:

support separate values for the decoding parameters (beam_width, length_penalty, etc.),
parts of SequenceToSequence assume a single output head (e.g. loss computation, reverse vocabulary lookup, exported outputs for model serving, etc. which should be moved in the decoder itself)

コントリビューターガイド

調査方針: 既存の `ParallelEncoder` 実装を参考に調査する。`SequenceToSequence` クラスに焦点を当て、損失計算や逆語彙検索など、単一の出力ヘッドを想定しているすべてのコンポーネントを特定する。各デコーダーごとに独立した beam width と length penalty をサポートする方法を理解するために、デコーダーパラメータの処理を確認する。関連する issue や PR をチェックし、以前の議論や部分的な実装がないか確認する。
技術スタック: pythontensorflow
領域: machine learningai
Issue 種別: 機能
難度: 4
推定時間: 1-2日
活動状況: 古い
明確さ: おおむね明確
前提条件: PythonTensorFlowsequence to sequence modelsOpenNMT tf codebase
初心者向け度: 30

Repository metrics

説明

コントリビューターガイド

新着 Easy issues をメールで受け取る。