speaker-disentangled speech linguistic content quantizer
仓库
Plachtaa 的仓库
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Training code for FAcodec presented in NaturalSpeech3
[ICASSP'26] Real-time streaming voice anonymization & voice conversion
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
zero-shot voice conversion & singing voice conversion, with real-time support