facebookresearch/fairseq

Evaluating TTS using STT

Open

#5,629 opened on Jul 19, 2025

View on GitHub
 (0 comments) (0 reactions) (0 assignees)Python (29,107 stars) (6,224 forks)batch import
enhancementhelp wantedneeds triage

Description

I tried to get a text back from the speech I generated using this model, but the text was a bit of. So maybe having some type of test could help while improving the model.

Basically, "some text" -> speech output -> speech input -> text output -> check if text output is "some text"

Contributor guide