ML images for hashistack.org - Machine Learning Models REST through MLflow
APACHE-2.0 License
How to use OpenAIs Whisper to transcribe and diarize audio files
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system ...
Open-source subtitle generation for seamless content translation.
A generative speech model for daily dialogue.
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single ...
A desktop application that transcribes audio from files, microphone input or YouTube videos with ...
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Using LLMs and pre-trained caption models for super-human performance on image captioning.
Explore the power of Gemma model with GemGPT, a project leveraging AI for innovative solutions. J...
🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Tool...
Empowering individual to agnostically run machine learning algorithms to produce ad-hoc AI features
OpenChat: Advancing Open-source Language Models with Imperfect Data
WhisperPlus: Advancing Speech-to-Text Processing 🚀