LongRoPE

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.

MIT License

Stars

Committers

View Code on GitHub View on X

Ecosystems: Playwright, VS Code Extension, Windows UI Library (WinUI), Windows, TypeScript

Commit Statistics

Past Year

All Time

Total Commits

Total Committers

Avg. Commits Per Committer

4.25

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

16 days

Related Projects

DialoGPT

Large-scale pretraining for dialogue

29 Aug 2019 2,353

MathOctopus

This repository contains resources for accessing the official benchmarks, codes, and checkpoints ...

25 Oct 2023 41

Industrial-Foundation-Models

Dedicated to building industrial foundation models for universal data intelligence across industr...

22 Mar 2024 32

ProbTS

ProbTS is a benchmarking toolkit for time series forecasting.

10 Oct 2023 89

X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and la...

28 Nov 2022 1,287

MASS

MASS: Masked Sequence to Sequence Pre-training for Language Generation

27 May 2019 1,116

GLIP

Grounded Language-Image Pre-training

24 Nov 2021 2,182

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

13 Dec 2022 3,604

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

30 Mar 2023 23,583

fastseq

An efficient implementation of the popular sequence models for text generation, summarization, an...

15 Jul 2020 431

MInference

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention,...

22 May 2024 725

DeBERTa

The implementation of DeBERTa

08 Jun 2020 1,976

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

18 Jun 2021 10,449

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabil...

20 May 2023 1,483

CodeBERT

17 Jun 2020 2,192