An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
APACHE-2.0 License
Statistics for this project are still being loaded, please check back later.
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in h...
Wav2Vec 2.0 catalan training scripts and models
WhisperPlus: Advancing Speech-to-Text Processing 🚀
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, w...
Simple Python library, distributed via binary wheels with few direct dependencies, for easily usi...
OneShot Learning-based hotword detection.
Inference and training library for high-quality TTS models.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's...
DeepMind's Tacotron-2 Tensorflow implementation
AudioLDM: Generate speech, sound effects, music and beyond, with text.
A TensorFlow implementation of DeepMind's WaveNet paper