ggml-org/whisper.cpp

tests : add WER benchmarks

Open

Aperta il 5 ott 2024

Vedi su GitHub
 (26 commenti) (0 reazioni) (0 assegnatari)C++ (49.693 star) (5535 fork)batch import
help wantedhigh priorityresearch🔬roadmap

Descrizione

It would be nice to start measuring the word error rate (WER) of whisper.cpp across some representative dataset:

  • short audio
  • long audio
  • english
  • non-english
  • etc.

This will help us catch regressions in the future. I'm not familiar with what is typically used for TTS WER benchmarks, so looking for help from the community.

Guida contributor