Repositories

w-okada repositories

99 supported repositories

Source code of APNet2, a vocoder

Last commit Nov 23, 2023

 (1 star) (0 forks) (0 indexed issues) (0 open good first issues)

Last commit Mar 18, 2019

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Last commit Oct 16, 2023

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

Last commit Jun 13, 2023

 (0 stars) (1 fork) (0 indexed issues) (0 open good first issues)

OneShot Learning-based hotword detection.

Last commit Sep 12, 2024

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Last commit Sep 19, 2024

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Last commit Apr 30, 2026

 (3 stars) (1 fork) (0 indexed issues) (0 open good first issues)

Last commit Nov 8, 2023

 (1 star) (0 forks) (0 indexed issues) (0 open good first issues)

Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.

Last commit May 20, 2025

 (1 star) (1 fork) (0 indexed issues) (0 open good first issues)

AIを使ったリアルタイムボイスチェンジャー(client)

Last commit Feb 10, 2023

 (2 stars) (0 forks) (0 indexed issues) (0 open good first issues)

AIを使ったリアルタイムボイスチェンジャー(Trainer)

Last commit Dec 9, 2022

 (1 star) (0 forks) (0 indexed issues) (0 open good first issues)

Last commit Aug 10, 2023

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

WinRTのGraphicsCaptureAPIでキャプチャしたウィンドウを仮想カメラとして映すサンプル

Last commit Feb 17, 2022

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8), EdgeTPU, CoreML.

Last commit Dec 20, 2022

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

Last commit Aug 19, 2024

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server

Last commit Nov 1, 2023

 (1 star) (0 forks) (0 indexed issues) (0 open good first issues)

Multilingual Voice Understanding Model

Last commit Sep 2, 2024

 (2 stars) (0 forks) (0 indexed issues) (0 open good first issues)

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Last commit Mar 7, 2024

 (1 star) (0 forks) (0 indexed issues) (0 open good first issues)

inverse kinematics for three.js

Last commit May 9, 2022

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)