Unofficial PyTorch implementation of Google AI's VoiceFilter system
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
An unofficial PyTorch implementation of the audio LM VALL-E
so-vits-svc fork with realtime support, improved interface and more features.
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE...
Neural building blocks for speaker diarization: speech activity detection, speaker change detecti...
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Data manipulation and transformation for audio signal processing, powered by PyTorch
An Open Source text-to-speech system built by inverting Whisper.
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
GUI for a Vocal Remover that uses Deep Neural Networks.
Foundational model for human-like, expressive TTS
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)