PaddlePaddle/PaddleSpeech

[TTS] Try to train a universial GAN Vocoder using CSMSC + LJSpeech + AISHELL3 + VCTK

Open

#2,803 opened on 2023年1月6日

GitHub で見る
 (0 comments) (1 reaction) (1 assignee)Python (9,453 stars) (1,702 forks)batch import
T2Sfeature requestgood first issue

説明

An universial GAN Vocoder may works well for all AMs of different datasets, for example, CSMSC is a single female dataset, may generate bad wavs for mels of male speakers, cause different genders have different distribution of speech features.

Please try to train a universial GAN Vocoder using CSMSC + LJSpeech + AISHELL3 + VCTK + some other TTS datasets (if you want) with the config of CSMSC (24kHz).

LJSpeech is 22.05kHz, but you don't need to resample it yourself, cause we will resample the wavs to the sample rate setted in config file in preprocess stage ~

コントリビューターガイド