Official PyTorch implementation of BigVGAN (ICLR 2023)
Repositórios
Repositórios de lars76
A mirror of BigVGAN and HiFi-GAN for access via PyTorch Hub.
Optical character recognition for Chinese subtitles using SSD and CNN
Simple config library in C
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
helloworld program using JSF, Maven, Glassfish, Java EE.
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
k-means clustering with the Intersection over Union (IoU) metric as described in the YOLO9000 paper
Using LLMs to generate a synthetic Chinese-English dictionary
Object localization in images using simple CNNs and Keras
Comprehensive benchmark suite comparing pitch detection algorithms across multiple datasets.
Python C module for creating suffix, LCP and BWT arrays with UTF-8 text.
Code for the paper "Effect of the output activation function on the probabilities and errors in medical image segmentation"
LLM-generated story dataset with quality evaluations across 15 models for training and benchmarking creative writing capabilities.
Fast and accurate fundamental frequency (F0) detector using convolutional neural networks