Repository

Repository di yl4579

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

Ultimo commit 16 giu 2022

 (125 star) (48 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 22 lug 2025

 (302 star) (40 fork) (0 issue indicizzate) (0 good first issue aperte)

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

Ultimo commit 14 gen 2025

 (253 star) (23 fork) (0 issue indicizzate) (0 good first issue aperte)

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Ultimo commit 13 gen 2025

 (269 star) (55 fork) (0 issue indicizzate) (0 good first issue aperte)
yl4579/ParallelWaveGANJupyter Notebook

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Ultimo commit 26 ago 2021

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Ultimo commit 22 ago 2022

 (151 star) (34 fork) (0 issue indicizzate) (0 good first issue aperte)

SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs

Ultimo commit 19 lug 2023

 (16 star) (1 fork) (0 issue indicizzate) (0 good first issue aperte)

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Ultimo commit 13 gen 2025

 (521 star) (111 fork) (0 issue indicizzate) (0 good first issue aperte)

Official Implementation of StyleTTS

Ultimo commit 13 gen 2025

 (464 star) (69 fork) (0 issue indicizzate) (0 good first issue aperte)

Official Implementation of StyleTTS-VC

Ultimo commit 14 gen 2025

 (199 star) (29 fork) (0 issue indicizzate) (0 good first issue aperte)

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion

Ultimo commit 27 set 2024

 (189 star) (15 fork) (0 issue indicizzate) (0 good first issue aperte)

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Ultimo commit 20 gen 2024

 (3429 star) (210 fork) (7 issue indicizzate) (7 good first issue aperte)

Python libraries for Google Colaboratory

Ultimo commit 24 ago 2021

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Ultimo commit 2 dic 2020

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

StarGAN v2 - Official PyTorch Implementation (CVPR 2020)

Ultimo commit 1 lug 2021

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)