Repository

Repository di Plachtaa

speaker-disentangled speech linguistic content quantizer

Ultimo commit 19 mar 2025

 (25 star) (5 fork) (0 issue indicizzate) (0 good first issue aperte)

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Ultimo commit 25 giu 2024

 (3 star) (1 fork) (0 issue indicizzate) (0 good first issue aperte)

Training code for FAcodec presented in NaturalSpeech3

Ultimo commit 26 ago 2024

 (243 star) (21 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 18 set 2024

 (0 star) (2 fork) (0 issue indicizzate) (0 good first issue aperte)

[ICASSP'26] Real-time streaming voice anonymization & voice conversion

Ultimo commit 15 apr 2026

 (75 star) (9 fork) (0 issue indicizzate) (0 good first issue aperte)

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Ultimo commit 3 nov 2023

 (6573 star) (624 fork) (6 issue indicizzate) (6 good first issue aperte)

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Ultimo commit 21 gen 2025

 (5017 star) (730 fork) (0 issue indicizzate) (0 good first issue aperte)

zero-shot voice conversion & singing voice conversion, with real-time support

Ultimo commit 20 apr 2025

 (3777 star) (488 fork) (0 issue indicizzate) (0 good first issue aperte)