ggml-org/whisper.cpp

tests : add WER benchmarks

Open

#2.454 geöffnet am 5. Okt. 2024

Auf GitHub ansehen
 (26 Kommentare) (0 Reaktionen) (0 zugewiesene Personen)C++ (49.693 Stars) (5.535 Forks)batch import
help wantedhigh priorityresearch🔬roadmap

Beschreibung

It would be nice to start measuring the word error rate (WER) of whisper.cpp across some representative dataset:

  • short audio
  • long audio
  • english
  • non-english
  • etc.

This will help us catch regressions in the future. I'm not familiar with what is typically used for TTS WER benchmarks, so looking for help from the community.

Contributor Guide