autoregressive-linear-attention-cuda

CUDA implementation of autoregressive linear attention, with all the latest research findings

MIT License

Stars

Committers

View Code on GitHub

Ecosystems: Python, Cuda

Commit Statistics

Past Year

All Time

Total Commits

Total Committers

Avg. Commits Per Committer

0.0

4.0

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

Related Projects

decoding_attention

Decoding Attention is specially optimized for multi head attention (MHA) using CUDA core for the ...

14 Aug 2024 14

alpaka

Abstraction Library for Parallel Kernel Acceleration

05 Nov 2014 303

DMHead

Dual model head pose estimation. Fusion of SOTA models. 360° 6D HeadPose detection. All pre-proce...

19 Jun 2022 65

PyDyNet

NumPy实现类PyTorch的动态计算图和神经网络框架(MLP, CNN, RNN, Transformer)

06 May 2022 71

gaussian-splatting-cuda

3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!

30 Jul 2023 862

kaolin

A PyTorch Library for Accelerating 3D Deep Learning Research

14 Nov 2019 4,262

marian-dev

Fast Neural Machine Translation in C++ - development repository

03 May 2016 255

ScaleLLM

A high-performance inference system for large language models, designed for production environments.

24 Jul 2023 289

efficient-dl-systems

Efficient Deep Learning Systems course materials (HSE, YSDA)

06 Dec 2021 651

thundergbm

ThunderGBM: Fast GBDTs and Random Forests on GPUs

11 Nov 2016 691

cccl

CUDA C++ Core Libraries

17 Sep 2020 743

CUDA-Learn-Notes

🎉 Modern CUDA Learn Notes with PyTorch: fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, ...

17 Dec 2022 1,308

neoheartbeats-kernel

An architecture for LLMs' continual-learning and long-term memories

26 Jul 2024 4

kernel_tuner

Kernel Tuner

28 Mar 2016 248

20220228_intel_deeplearning_day_hitnet_demo

Special Presentation Demo at Intel IoT Planet 2021 DeepLearning Day / インテル IoT プラネット 2021 DeepLea...

11 Feb 2022 19