PaddlePaddle/PaddleSpeech

[TTS] Try to train a universial GAN Vocoder using CSMSC + LJSpeech + AISHELL3 + VCTK

Open

#2803 aperta il 6 gen 2023

Vedi su GitHub
 (0 commenti) (1 reazione) (1 assegnatario)Python (1702 fork)batch import
T2Sfeature requestgood first issue

Metriche repository

Star
 (9453 star)
Metriche merge PR
 (Nessuna PR mergiata in 30 g)

Descrizione

An universial GAN Vocoder may works well for all AMs of different datasets, for example, CSMSC is a single female dataset, may generate bad wavs for mels of male speakers, cause different genders have different distribution of speech features.

Please try to train a universial GAN Vocoder using CSMSC + LJSpeech + AISHELL3 + VCTK + some other TTS datasets (if you want) with the config of CSMSC (24kHz).

LJSpeech is 22.05kHz, but you don't need to resample it yourself, cause we will resample the wavs to the sample rate setted in config file in preprocess stage ~

Guida contributor