Audio super resolution using neural networks
MIT License
Versatile audio super resolution (any -> 48kHz) with AudioSR.
基于Pytorch实现的语音情感识别
Inference and training library for high-quality TTS models.
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Generative Models by Stability AI
基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法
Code and dataset for photorealistic Codec Avatars driven from audio
Single channel speech source separation by diffusion process (ICASSP 2023)
Audio Classification using Image Classification
Keras WaveNet implementation
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Generative models for conditional audio generation
A TensorFlow implementation of DeepMind's WaveNet paper
Zero-Mean Convolutions for Level-Invariant Singing Voice Detection
OneShot Learning-based hotword detection.