ggml-org/whisper.cpp

tests : add WER benchmarks

Open

#2 454 ouverte le 5 oct. 2024

Voir sur GitHub
 (26 commentaires) (0 réactions) (0 assignés)C++ (49 693 stars) (5 535 forks)batch import
help wantedhigh priorityresearch🔬roadmap

Description

It would be nice to start measuring the word error rate (WER) of whisper.cpp across some representative dataset:

  • short audio
  • long audio
  • english
  • non-english
  • etc.

This will help us catch regressions in the future. I'm not familiar with what is typically used for TTS WER benchmarks, so looking for help from the community.

Guide contributeur