Это две нейросети соединённые одним модулем. Одна для распознавания, другая для генерация голоса.
MIT License
Multilingual Voice Understanding Model
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...
WaveRNN Vocoder + TTS
Foundational model for human-like, expressive TTS
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Unofficial PyTorch implementation of Google AI's VoiceFilter system
A collection of Audio and Speech pre-trained models.
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine