Implementation of Unsupervised Audio Spectrogram Compression using Vector Quantized Autoencoders
Statistics for this project are still being loaded, please check back later.
Core Engine of Singing Voice Conversion & Singing Voice Clone
Audio super resolution using neural networks
A Python Tool for Analysis of Mouse Vocal Communication
A PyTorch library and evaluation platform for end-to-end compression research
基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法
Variational auto-encoders for audio
A screaming vocal samples dataset.
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 ...
WaveGAN: Learn to synthesize raw audio with generative adversarial networks
🔈 ⃕ 🖼
AudioLDM: Generate speech, sound effects, music and beyond, with text.
基于Pytorch实现的语音情感识别
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
A JAX Implementation of the Descript Audio Codec