Deep Semi-Supervised Learning with Holistic methods for audio classification.
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
An Open Source text-to-speech system built by inverting Whisper.
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV...
A PyTorch-based library for semi-supervised learning (NeurIPS'21)
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
Foundational model for human-like, expressive TTS
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
A PyTorch-based Speech Toolkit
Kaggle | 1st place solution for Freesound Audio Tagging 2019
End-to-End Speech Processing Toolkit
The training code for the 4th place model at MDX 2021 leaderboard A.
Official Implementation of Mockingjay in Pytorch
solo-learn: a library of self-supervised methods for visual representation learning powered by Py...
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...