autoregressive-linear-attention-cuda

CUDA implementation of autoregressive linear attention, with all the latest research findings

MIT License

Stars

Committers

View Code on GitHub

Ecosystems: Python, Cuda

Commit Statistics

Past Year

All Time

Total Commits

Total Committers

Avg. Commits Per Committer

0.0

4.0

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

Related Projects

agent-attention-pytorch

Implementation of Agent Attention in Pytorch

18 Dec 2023 85

block-recurrent-transformer-pytorch

Implementation of Block Recurrent Transformer - Pytorch

07 Feb 2023 212

h-transformer-1d

Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning

28 Jul 2021 153

mixture-of-attention

Some personal experiments around routing tokens to different autoregressive attention, akin to mi...

21 Apr 2023 101

simple-hierarchical-transformer

Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT

06 Apr 2023 204

memory-efficient-attention-pytorch

Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attenti...

03 Mar 2022 356

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architectu...

09 Dec 2022 7,595

VN-transformer

A Transformer made of Rotation-equivariant Attention using Vector Neurons

12 Jun 2022 81

deformable-attention

Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deforma...

17 Mar 2022 275

taylor-series-linear-attention

Explorations into the recently proposed Taylor Series Linear Attention

23 Dec 2023 88

self-attention-cv

Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.

31 Jan 2021 1,171

q-transformer

Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Fun...

20 Sep 2023 338

iTransformer

Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks...

11 Oct 2023 429

local-attention

An implementation of local windowed attention for language modeling

05 Jul 2020 375

performer-pytorch

An implementation of Performer, a linear attention-based transformer, in Pytorch

03 Oct 2020 1,084