Unofficial PyTorch dataset for Slakh
MIT License
Statistics for this project are still being loaded, please check back later.
A screaming vocal samples dataset.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Genera...
Contrastive Language-Audio Pretraining
DCASE2024 Challenge Task 6 baseline system (Automated Audio Captioning)
Functional, lazy-evaluated dataset manipulation library for ML in Python
AudioLDM: Generate speech, sound effects, music and beyond, with text.
基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法
Data manipulation and transformation for audio signal processing, powered by PyTorch
A Pytorch implementation of Onsets and Frames (Hawthorne 2018)
Easily turn large sets of audio urls to an audio dataset.
Audio super resolution using neural networks