speech-to-speech

Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"

Stars

View Code on GitHub Visit Website View on X

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

Related Projects

parler-tts

Inference and training library for high-quality TTS models.

13 Feb 2024 2,623

ZS-TTS-Evaluation

28 Jan 2024 30

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

13 Jan 2023 1,898

Automatic-Lipreading-translator

05 Apr 2024 2

ZeroSpeech-TTS-without-T

A Pytorch implementation for the ZeroSpeech 2019 challenge.

15 Feb 2019 110

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

19 Apr 2023 1,269

svoice

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multi...

19 Nov 2020 1,227

Whisper-WebUI

A Web UI for easy subtitle using whisper model.

02 Mar 2023 1,083

whisper-plus

WhisperPlus: Advancing Speech-to-Text Processing 🚀

21 Nov 2023 1,318

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to supp...

15 Nov 2023 4,482

AudioLDM2

Text-to-Audio/Music Generation

04 Aug 2023 2,248