Self-Supervised Speech Pre-training and Representation Learning Toolkit
APACHE-2.0 License
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 min...
MiniSora: A community aims to explore the implementation path and future development direction of...
A Pytorch implementation for the ZeroSpeech 2019 challenge.
Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的情感分析,使用PyTorch实现。
Official implementation of "Separate Anything You Describe"
A concise but complete implementation of CLIP with various experimental improvements from recent ...
DALL·E Mini - Generate images from a text prompt
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities...
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Re...
Must-read Papers on Textual Adversarial Attack and Defense
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to supp...
Inference and training library for high-quality TTS models.
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable...
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation)...