EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

APACHE-2.0 License

Stars

574

View Code on GitHub

Ecosystems: Python

Statistics for this project are still being loaded, please check back later.

Badges

Extracted from project README

Related Projects

LongMamba

Some preliminary explorations of Mamba's context scaling.

03 Feb 2024 177

Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

23 Oct 2023 2,881

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

02 Sep 2023 7,733

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

14 Dec 2023 1,932

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vi...

19 Mar 2023 36,628

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...

06 Jul 2023 1,448

LLMZoo

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language mod...

01 Apr 2023 2,927

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

21 Sep 2023 2,607

yarn

YaRN: Efficient Context Window Extension of Large Language Models

26 Jun 2023 1,316

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

12 Oct 2023 2,138

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

08 Aug 2023 1,368

LWM

08 Feb 2024 7,097

flash-attention

Fast and memory-efficient exact attention

19 May 2022 11,791

Vary

Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language M...

07 Dec 2023 1,518

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without ...

02 Oct 2023 655