Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
AGPL-3.0 License
Statistics for this project are still being loaded, please check back later.
Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package depende...
AudioLDM: Generate speech, sound effects, music and beyond, with text.
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 ...
A TensorFlow implementation of DeepMind's WaveNet paper
TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including ...
A generative speech model for daily dialogue.
Multi-lingual large voice generation model, providing inference, training and deployment full-sta...
Wav2Lip UHQ extension for Automatic1111
Data manipulation and transformation for audio signal processing, powered by PyTorch
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
an extremely simple tool for separating vocals and background music, completely localized for web...
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
A collection of Audio and Speech pre-trained models.