A utility for wrapping the Free Spoken Digit Dataset into PyTorch-ready data set splits.
MIT License
Audiocraft is a library for audio processing and generation with deep learning. It features the s...
Data manipulation and transformation for audio signal processing, powered by PyTorch
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TT...
Unsupervised Language Modeling at scale for robust sentiment classification
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 ...
Inference and training library for high-quality TTS models.
Home of StarCoder2!
Easily turn large sets of audio urls to an audio dataset.
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
AeCC: Autoencoders for Compressed Communication
TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including ...
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Contrastive Language-Audio Pretraining