Convert text and audio to facial expressions
MIT License
Human-detection-and-Tracking
Control your media player with your face. Dataset creation, training, model selection and inference.
http://www.facegood.cc
Fine-tuned LLaMa2 13B model designed for ReAct-style and Tree-Of-Thoughts style prompting.
Easiest way of fine-tuning HuggingFace video classification models
WhisperPlus: Advancing Speech-to-Text Processing 🚀
Multi-layer Recurrent Neural Networks (LSTM, RNN) for character-level language models in Python u...
Batch Face Detection and Alignment for Modern Research
Embed arbitrary modalities (images, audio, documents, etc) into large language models.
code for training the models from the paper "Learning Individual Styles of Conversational Gestures"
Reverse engineer your pytorch vision models, in style
How to use OpenAIs Whisper to transcribe and diarize audio files
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Genera...
本项目基于PaddleDetection目标检测开发套件,选取1.3M超轻量PPYOLO tiny进行项目开发,并部署于windows端。