facebookresearch/fairseq

Add an option to disable appending eos to the end of each document while preprocessing

Open

#5,615 opened on May 26, 2025

View on GitHub
 (0 comments) (0 reactions) (0 assignees)Python (29,107 stars) (6,224 forks)batch import
enhancementhelp wantedneeds triage

Description

🚀 Feature Request

Add an option to disable appending EOS to the end of each document while preprocessing

Motivation

There are instances where adding EOS is not preferred like when packing data to sequence length.

Pitch

Proposing a change to add an argument to handle this

Contributor guide