Multi-decoder setup for multi-task learning · OpenNMT/OpenNMT-tf#130

(5 comments) (0 reactions) (0 assignees)Python (394 forks)batch import

enhancementhelp wanted

Repository metrics

Stars: (1,428 stars)
PR merge metrics: (No merged PRs in 30d)

Description

Similarly to the ParallelEncoder, a ParallelDecoder setup could allow multi-task learning. This should not be too hard to implement but we need to take care of some details:

support separate values for the decoding parameters (beam_width, length_penalty, etc.),
parts of SequenceToSequence assume a single output head (e.g. loss computation, reverse vocabulary lookup, exported outputs for model serving, etc. which should be moved in the decoder itself)

Contributor guide

Research direction: Study the existing ParallelEncoder implementation to understand the pattern, then create a ParallelDecoder class with separate decoding parameters. Refactor SequenceToSequence to handle multiple decoders for loss computation and vocabulary lookup.
Tech stack: pythontensorflow
Domain: aibackend
Issue type: Feature
Difficulty: 3
Estimated time: Half day
Activity status: Active
Clarity: Clear
Prerequisites: PythonTensorFlowGit
Newbie friendliness: 70

Repository metrics

Description

Contributor guide

Get fresh easy issues in your inbox.