ggml-org/whisper.cpp

tests : add WER benchmarks

Open

#2.454 aberto em 5 de out. de 2024

Ver no GitHub
 (26 comments) (0 reactions) (0 assignees)C++ (49.693 stars) (5.535 forks)batch import
help wantedhigh priorityresearch🔬roadmap

Description

It would be nice to start measuring the word error rate (WER) of whisper.cpp across some representative dataset:

  • short audio
  • long audio
  • english
  • non-english
  • etc.

This will help us catch regressions in the future. I'm not familiar with what is typically used for TTS WER benchmarks, so looking for help from the community.

Guia do colaborador