An STFT/iSTFT for PyTorch.
BSD-3-CLAUSE License
SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/lat...
A utility for wrapping the Free Spoken Digit Dataset into PyTorch-ready data set splits.
Curated list of python software and packages related to scientific research in audio
PyTorch implementation of Tacotron speech synthesis model.
Synthesis of percussion sounds using sinusoidal modelling, DDSP noise synthesis, and a neural sou...
Stable diffusion for real-time music generation
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine lear...
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Data manipulation and transformation for audio signal processing, powered by PyTorch
YAAPT Pitch Tracking function in PyTorch
Benchmark popular audio i/o packages
Ready-to-use Multilingual Text-To-Speech (TTS) package.
spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io
A desktop application that transcribes audio from files, microphone input or YouTube videos with ...
AudioLDM: Generate speech, sound effects, music and beyond, with text.