diart

A python package to build AI-powered real-time audio applications

MIT License

Downloads

5.1K

Stars

830

View Code on GitHub Visit Website

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

3 months

Package Rankings

Top 9.51% on Pypi.org

Related Projects

auxiva-ipa

Fast algorithm for determined blind source separation with update of demixing filters with joint ...

23 Aug 2020 28

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

26 Mar 2024 2,507

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system ...

17 Oct 2023 1,255

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to supp...

15 Nov 2023 4,482

Whisper-transcription_and_diarization-speaker-identification-

How to use OpenAIs Whisper to transcribe and diarize audio files

12 Oct 2022 285

voxceleb_trainer

In defence of metric learning for speaker recognition

26 Mar 2020 1,031

parler-tts

Inference and training library for high-quality TTS models.

13 Feb 2024 2,623

ASR_benchmark

Program to benchmark various speech recognition APIs

11 Dec 2017 78

SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's spe...

18 Jan 2019 507

diffusion-separation

Single channel speech source separation by diffusion process (ICASSP 2023)

10 Mar 2023 83

AudioLDM2

Text-to-Audio/Music Generation

04 Aug 2023 2,248

ChatTTS

A generative speech model for daily dialogue.

27 May 2024 31,328

whisper-plus

WhisperPlus: Advancing Speech-to-Text Processing 🚀

21 Nov 2023 1,318

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

29 Jan 2023 2,400

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

13 Jan 2023 1,898