facebookresearch/fairseq

Add an option to disable appending eos to the end of each document while preprocessing

Open

#5,615 创建于 2025年5月26日

在 GitHub 查看
 (0 评论) (0 反应) (0 负责人)Python (29,107 star) (6,224 fork)batch import
enhancementhelp wantedneeds triage

描述

🚀 Feature Request

Add an option to disable appending EOS to the end of each document while preprocessing

Motivation

There are instances where adding EOS is not preferred like when packing data to sequence length.

Pitch

Proposing a change to add an argument to handle this

贡献者指南

Add an option to disable appending eos to the end of each document while preprocessing · facebookresearch/fairseq#5615 | Good First Issue