Cuda Ecosystem

Community Repos
5.3K
Experts
549
Created by: NVIDIA
Released: June 23, 2007
Popular Projects 
More

cudarc

Safe rust wrapper around CUDA toolkit

16 Sep 2022 597

https://github.com/lebedov/scikit-cuda

Python interface to GPU-powered libraries

27 Sep 2010 986

https://github.com/spcl/dace

DaCe - Data Centric Parallel Programming

26 Feb 2019 469

https://github.com/m4rs-mt/ILGPU

ILGPU JIT Compiler for high-performance

08 Jan 2017 1,343

https://github.com/NVIDIAGameWorks/kaolin

A PyTorch Library for Accelerating 3D Deep Learning Research

14 Nov 2019 4,262

cccl

CUDA C++ Core Libraries

17 Sep 2020 743

https://github.com/jgbit/vuda

VUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writing GPU-accelerated applications

06 Oct 2018 852

https://github.com/gunrock/gunrock

Programmable CUDA/C++ GPU Graph Analytics

03 Nov 2013 965

https://github.com/sony/nnabla-ext-cuda

A CUDA Extension of Neural Network Libraries

21 Jun 2017 92

https://github.com/DefTruth/CUDA-Learn-Notes

🎉 Modern CUDA Learn Notes with PyTorch: fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm

17 Dec 2022 1,308

https://github.com/brian-team/brian2cuda

A brian2 extension to simulate spiking neural networks on GPUs

27 Oct 2015 58

cuda-toolkit

GitHub Action to install CUDA

10 Mar 2021 146

kernel_tuner

Kernel Tuner

28 Mar 2016 248

https://github.com/clara-parabricks/GenomeWorks

SDK for GPU accelerated genome assembly and analysis

31 May 2019 284

https://github.com/koide3/fast_gicp

A collection of GICP-based fast point cloud registration algorithms

05 Feb 2020 1,216

https://github.com/mp3guy/ElasticFusion

Real-time dense visual SLAM system

22 Oct 2015 1,772

marian-dev

Fast Neural Machine Translation in C++ - development repository

03 May 2016 255

librapid

A highly optimised C++ library for mathematical applications and neural networks

25 May 2021 163

xmrig-cuda

NVIDIA CUDA plugin for XMRig miner

28 Oct 2019 368

LuisaCompute

High-Performance Rendering Framework on Stream Architectures

20 Nov 2020 636
New Projects 
More

https://github.com/akhuntsaria/canny-edge-detection

Canny edge detector implemented in CUDA C/C++

10 Oct 2024 2

https://github.com/Qervas/cn_chess_ai

chinese chess(Xiangqi) AI

18 Sep 2024 1

https://github.com/MAJ0RRR/parallel-processing-cpu-and-gpu-env-and-lib-with-powercap

(2024/2025) A library and environment for parallel processing in a power-limited CPU+GPU cluster environment

16 Aug 2024 2

https://github.com/Bruce-Lee-LY/decoding_attention

Decoding Attention is specially optimized for multi head attention (MHA) using CUDA core for the decoding stage of LLM inference

14 Aug 2024 14

https://github.com/SamuraiBUPT/CUDA_Code

Codes for learning cuda

11 Aug 2024 2

https://github.com/dancing-ui/uestc_vhm

使用yolov8、fast-reid、deepsort完成目标跟踪

10 Aug 2024 6

https://github.com/LambdaLabsML/distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

31 Jul 2024 190

https://github.com/ZephirFXEC/HNanoSolver

Houdini GPU Fluid Solver powered by NanoVDB

29 Jul 2024 7

https://github.com/Kentakoong/mtnlog

A simple multinode performance logger for Python

29 Jul 2024 0

https://github.com/neoheartbeats/neoheartbeats-kernel

An architecture for LLMs' continual-learning and long-term memories

26 Jul 2024 4

https://github.com/ACDSLab/MPPI-Generic

Templated C++/CUDA implementation of Model Predictive Path Integral Control (MPPI)

24 Jul 2024 19

https://github.com/bkraad47/fat_llama

fat_llama is a Python package for upscaling audio files to FLAC or WAV formats using advanced audio processing techniques

21 Jul 2024 5

PyAV-CUDA

Extension of PyAV (ffmpeg bindings) with hardware decoding support

15 Jul 2024 0

tinyGPUlang

Tutorial on building a gpu compiler backend in LLVM

14 Jul 2024 7

whisper-onnx-python

A low-footprint GPU accelerated Speech to Text Python package for the Jetpack 5 era bolstered by an optimized graph

24 Jun 2024 1

Infero

An easy to use, high performant CUDA powered LLM inference library

05 Jun 2024 12

voice-gulliver

The best gradio web-ui for ai subtitle, translation and dubbing

05 Jun 2024 1

vacancies_server

This is a server for vacancies generation using LLM (Saiga3)

03 May 2024 1

Sparky-2

This is a discord bot running on llama cpp with the llama 3 model and image geneartion

27 Apr 2024 5

KuiperLLama

校招、秋招、春招、实习好项目,带你从零动手实现支持LLama的大模型推理框架。

25 Apr 2024 191