Repository

Repository di w-okada

Source code of APNet2, a vocoder

Ultimo commit 23 nov 2023

 (1 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 18 mar 2019

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Ultimo commit 16 ott 2023

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 13 giu 2023

 (0 star) (1 fork) (0 issue indicizzate) (0 good first issue aperte)

OneShot Learning-based hotword detection.

Ultimo commit 12 set 2024

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Ultimo commit 19 set 2024

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Ultimo commit 30 apr 2026

 (3 star) (1 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 8 nov 2023

 (1 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.

Ultimo commit 20 mag 2025

 (1 star) (1 fork) (0 issue indicizzate) (0 good first issue aperte)

AIを使ったリアルタイムボイスチェンジャー(client)

Ultimo commit 10 feb 2023

 (2 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

AIを使ったリアルタイムボイスチェンジャー(Trainer)

Ultimo commit 9 dic 2022

 (1 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 10 ago 2023

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

WinRTのGraphicsCaptureAPIでキャプチャしたウィンドウを仮想カメラとして映すサンプル

Ultimo commit 17 feb 2022

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8), EdgeTPU, CoreML.

Ultimo commit 20 dic 2022

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 19 ago 2024

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server

Ultimo commit 1 nov 2023

 (1 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

Multilingual Voice Understanding Model

Ultimo commit 2 set 2024

 (2 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Ultimo commit 7 mar 2024

 (1 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

inverse kinematics for three.js

Ultimo commit 9 mag 2022

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)