m-bain/whisperX

Convert arabic numerals and symbols to phonetic form

Open

#38 opened on 2023年1月25日

GitHub で見る
 (2 comments) (0 reactions) (0 assignees)Python (6,880 stars) (650 forks)batch import
help wanted

説明

Currently arabic numerals and symbols in whisper transcript cannot be aligned, needs to be phonetic alphabet.

Need to perform inverse of normalization in https://github.com/m-bain/whisperX/blob/main/whisperx/normalizers/english.py

Such that numbers and currencies are converted to their phonetic word form.

E.g. "$300" -> "three hundred dollars"

To perform wav2vec alignment.

Then convert back to symbol form, and assign timestamps.

コントリビューターガイド