kyutai-labs/moshiPython
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
(10,347 stars) (966 forks) (0 indexed issues) (0 open good first issues)
Repositories
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.