🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
OTHER License
No README available, please check again later.
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.
A Web UI for easy subtitle using whisper model.
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE...
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Это две нейросети соединённые одним модулем. Одна для распознавания, другая для генерация голоса.
An Open Source text-to-speech system built by inverting Whisper.
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...
A collection of Audio and Speech pre-trained models.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Unofficial PyTorch implementation of Google AI's VoiceFilter system
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Official Implementation of Mockingjay in Pytorch