m-bain/whisperX

Convert arabic numerals and symbols to phonetic form

Open

#38 建立於 2023年1月25日

在 GitHub 查看
 (2 留言) (0 反應) (0 負責人)Python (6,880 star) (650 fork)batch import
help wanted

描述

Currently arabic numerals and symbols in whisper transcript cannot be aligned, needs to be phonetic alphabet.

Need to perform inverse of normalization in https://github.com/m-bain/whisperX/blob/main/whisperx/normalizers/english.py

Such that numbers and currencies are converted to their phonetic word form.

E.g. "$300" -> "three hundred dollars"

To perform wav2vec alignment.

Then convert back to symbol form, and assign timestamps.

貢獻者指南