Code for the paper "Jukebox: A Generative Model for Music"
OTHER License
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (...
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Python library for automatic training, optimization and comparison of Transformer models on most ...
Foundational model for human-like, expressive TTS
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch