Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis (Meso-DTFA)
MIT License
Statistics for this project are still being loaded, please check back later.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to supp...
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks,...
TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including ...
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Audiocraft is a library for audio processing and generation with deep learning. It features the s...
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Re...
A library for audio and music analysis, feature extraction.
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Offic...
Differentiable dynamic range controller in PyTorch.
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Implementation of Unsupervised Audio Spectrogram Compression using Vector Quantized Autoencoders
Curated list of python software and packages related to scientific research in audio