Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"
BSD-3-CLAUSE License
Gluon CV Toolkit
Pytorch implementation of Compressive Transformers, from Deepmind
DALL·E Mini - Generate images from a text prompt
Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
keras implement of transformers for humans
Meta-Transformer for Unified Multimodal Learning
🛠️ Tools for Transformers compression using PyTorch Lightning ⚡
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tune...
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
code for EMNLP 2022 paper Better Few-Shot Relation Extraction with Label Prompt Dropout
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities...
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模...