Signal processing and ML
APACHE-2.0 License
A desktop application that transcribes audio from files, microphone input or YouTube videos with ...
[ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning fr...
Modular Single-file Reinfocement Learning Algorithms Library
[CVPR 2021] Code for "Augmentation Strategies for Learning with Noisy Labels".
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Offic...
A curated list of useful Python packages for data geeks
Train vision models using JAX and 🤗 transformers
Open dubbing is an AI dubbing system which uses machine learning models to automatically translat...
Implementations of computer vision algorithms
Generate code from DSDL using PyDSDL and Jinja2
A module to compute textual lexical richness (aka lexical diversity).
Vision-Augmented Retrieval and Generation (VARAG)
A Python project that trains a Deep Neural Network to distinguish between Music Symbols
Generic image compressor for machine learning. Pytorch code for our paper "Lossy compression for ...
image based ecological information system