Repositories

yl4579 repositories

15 supported repositories

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

Last commit Jun 16, 2022

 (125 stars) (48 forks) (0 indexed issues) (0 open good first issues)

Last commit Jul 22, 2025

 (302 stars) (40 forks) (0 indexed issues) (0 open good first issues)

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

Last commit Jan 14, 2025

 (253 stars) (23 forks) (0 indexed issues) (0 open good first issues)

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Last commit Jan 13, 2025

 (269 stars) (55 forks) (0 indexed issues) (0 open good first issues)
yl4579/ParallelWaveGANJupyter Notebook

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Last commit Aug 26, 2021

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Last commit Aug 22, 2022

 (151 stars) (34 forks) (0 indexed issues) (0 open good first issues)

SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs

Last commit Jul 19, 2023

 (16 stars) (1 fork) (0 indexed issues) (0 open good first issues)

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Last commit Jan 13, 2025

 (521 stars) (111 forks) (0 indexed issues) (0 open good first issues)

Official Implementation of StyleTTS

Last commit Jan 13, 2025

 (464 stars) (69 forks) (0 indexed issues) (0 open good first issues)

Official Implementation of StyleTTS-VC

Last commit Jan 14, 2025

 (199 stars) (29 forks) (0 indexed issues) (0 open good first issues)

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion

Last commit Sep 27, 2024

 (189 stars) (15 forks) (0 indexed issues) (0 open good first issues)

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Last commit Jan 20, 2024

 (3,429 stars) (210 forks) (7 indexed issues) (7 open good first issues)

Python libraries for Google Colaboratory

Last commit Aug 24, 2021

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Last commit Dec 2, 2020

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

StarGAN v2 - Official PyTorch Implementation (CVPR 2020)

Last commit Jul 1, 2021

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)