Cuda Ecosystem

Community Repos
5.3K
Experts
549
Created by: NVIDIA
Released: June 23, 2007

https://github.com/Bruce-Lee-LY/decoding_attention

Decoding Attention is specially optimized for multi head attention (MHA) using CUDA core for the decoding stage of LLM inference

14 Aug 2024 14

https://github.com/bkraad47/fat_llama

fat_llama is a Python package for upscaling audio files to FLAC or WAV formats using advanced audio processing techniques

21 Jul 2024 5

Infero

An easy to use, high performant CUDA powered LLM inference library

05 Jun 2024 12

LuisaCompute

High-Performance Rendering Framework on Stream Architectures

20 Nov 2020 636

light-the-torch

Install PyTorch distributions with computation backend auto-detection

09 Jul 2020 218

https://github.com/koide3/fast_gicp

A collection of GICP-based fast point cloud registration algorithms

05 Feb 2020 1,216

computeWorks_examples

Matrix multiplication example performed with OpenMP, OpenACC, BLAS, cuBLABS, and CUDA

16 May 2019 6

nvml_examples

Examples showing how to utilize the NVML library for GPU monitoring

09 May 2019 16

https://github.com/spcl/dace

DaCe - Data Centric Parallel Programming

26 Feb 2019 469

https://github.com/arbor-sim/arbor

The Arbor multi-compartment neural network simulation library

03 Oct 2016 108

TIGRE

TIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox

13 Jun 2016 561