GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
APACHE-2.0 License
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Adapting LLaMA Decoder to Vision Transformer
An open platform for training, serving, and evaluating large language models. Release repo for Vi...
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantiz...
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
The PyTorch Implementation based on YOLOv4 of the paper: "Complex-YOLO: Real-time 3D Object Detec...
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregress...
Official Pytorch Implementation of "OwLore: Outlier-weighed Layerwise Sampled Low-Rank Projection...
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
GLM (General Language Model)
minichatgpt - To Train ChatGPT In 5 Minutes
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)