PaddlePaddle/PaddleSpeech

[TTS] Try to train a universial GAN Vocoder using CSMSC + LJSpeech + AISHELL3 + VCTK

Open

#2.803 aberto em 6 de jan. de 2023

Ver no GitHub
 (0 comments) (1 reaction) (1 assignee)Python (1.702 forks)batch import
T2Sfeature requestgood first issue

Métricas do repositório

Stars
 (9.453 stars)
Métricas de merge de PR
 (Nenhuma PRs mesclada em 30d)

Description

An universial GAN Vocoder may works well for all AMs of different datasets, for example, CSMSC is a single female dataset, may generate bad wavs for mels of male speakers, cause different genders have different distribution of speech features.

Please try to train a universial GAN Vocoder using CSMSC + LJSpeech + AISHELL3 + VCTK + some other TTS datasets (if you want) with the config of CSMSC (24kHz).

LJSpeech is 22.05kHz, but you don't need to resample it yourself, cause we will resample the wavs to the sample rate setted in config file in preprocess stage ~

Guia do colaborador