m-bain/whisperX

Convert arabic numerals and symbols to phonetic form

Open

#38 创建于 2023年1月25日

在 GitHub 查看
 (2 评论) (0 反应) (0 负责人)Python (6,880 star) (650 fork)batch import
help wanted

描述

Currently arabic numerals and symbols in whisper transcript cannot be aligned, needs to be phonetic alphabet.

Need to perform inverse of normalization in https://github.com/m-bain/whisperX/blob/main/whisperx/normalizers/english.py

Such that numbers and currencies are converted to their phonetic word form.

E.g. "$300" -> "three hundred dollars"

To perform wav2vec alignment.

Then convert back to symbol form, and assign timestamps.

贡献者指南

Convert arabic numerals and symbols to phonetic form · m-bain/whisperX#38 | Good First Issue