:link: Some useful websites for programmers.
倉庫
mpariente 的倉庫
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning"
An open source dataset for source separation
First repository
Ranger - a synergistic optimizer using RAdam (Rectified Adam) and LookAhead in one codebase
A Pure-Python Real-Time Audio Library
A must-read paper for speech separation based on neural networks
A Flexible and Powerful Parameter Server for large-scale machine learning
simple audio I/O for pytorch
Build the Linux Kernel and Modules on board the NVIDIA Jetson Nano Developer Kit
https://github.com/DataTalksClub
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
🤗 Fast, efficient, open-access datasets and evaluation metrics in PyTorch, TensorFlow, NumPy and Pandas
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
End-to-End Speech Processing Toolkit
We'll see what this becomes
Header-only library for using Keras models in C++.
This is the code for the "How to Deploy a Keras Model to Production" by Siraj Raval on Youtube