Cuda Ecosystem

CUDA® is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). With CUDA, developers are able to dramatically speed up computing applications by harnessing the power of GPUs.

Community Repos

Experts

Visit Website View on GitHub

Created by: NVIDIA

Released: June 23, 2007

Popular Projects

More

cudarc

Safe rust wrapper around CUDA toolkit

16 Sep 2022 597

https://github.com/lebedov/scikit-cuda

Python interface to GPU-powered libraries

27 Sep 2010 986

https://github.com/spcl/dace

DaCe - Data Centric Parallel Programming

26 Feb 2019 469

https://github.com/m4rs-mt/ILGPU

ILGPU JIT Compiler for high-performance

08 Jan 2017 1,343

https://github.com/NVIDIAGameWorks/kaolin

A PyTorch Library for Accelerating 3D Deep Learning Research

14 Nov 2019 4,262

cccl

CUDA C++ Core Libraries

17 Sep 2020 743

https://github.com/jgbit/vuda

VUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writing GPU-accelerated applications

06 Oct 2018 852

https://github.com/gunrock/gunrock

Programmable CUDA/C++ GPU Graph Analytics

03 Nov 2013 965

https://github.com/sony/nnabla-ext-cuda

A CUDA Extension of Neural Network Libraries

21 Jun 2017 92

https://github.com/DefTruth/CUDA-Learn-Notes

🎉 Modern CUDA Learn Notes with PyTorch: fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm

17 Dec 2022 1,308

https://github.com/brian-team/brian2cuda

A brian2 extension to simulate spiking neural networks on GPUs

27 Oct 2015 58

cuda-toolkit

GitHub Action to install CUDA

10 Mar 2021 146

kernel_tuner

Kernel Tuner

28 Mar 2016 248

https://github.com/clara-parabricks/GenomeWorks

SDK for GPU accelerated genome assembly and analysis

31 May 2019 284

https://github.com/koide3/fast_gicp

A collection of GICP-based fast point cloud registration algorithms

05 Feb 2020 1,216

https://github.com/mp3guy/ElasticFusion

Real-time dense visual SLAM system

22 Oct 2015 1,772

marian-dev

Fast Neural Machine Translation in C++ - development repository

03 May 2016 255

librapid

A highly optimised C++ library for mathematical applications and neural networks

25 May 2021 163

xmrig-cuda

NVIDIA CUDA plugin for XMRig miner

28 Oct 2019 368

LuisaCompute

High-Performance Rendering Framework on Stream Architectures

20 Nov 2020 636

More Popular

New Projects

More

https://github.com/akhuntsaria/canny-edge-detection

Canny edge detector implemented in CUDA C/C++

10 Oct 2024 2

https://github.com/Qervas/cn_chess_ai

chinese chess(Xiangqi) AI

18 Sep 2024 1

https://github.com/MAJ0RRR/parallel-processing-cpu-and-gpu-env-and-lib-with-powercap

(2024/2025) A library and environment for parallel processing in a power-limited CPU+GPU cluster environment

16 Aug 2024 2

https://github.com/Bruce-Lee-LY/decoding_attention

Decoding Attention is specially optimized for multi head attention (MHA) using CUDA core for the decoding stage of LLM inference

14 Aug 2024 14

https://github.com/SamuraiBUPT/CUDA_Code

Codes for learning cuda

11 Aug 2024 2

https://github.com/dancing-ui/uestc_vhm

使用yolov8、fast-reid、deepsort完成目标跟踪

10 Aug 2024 6

https://github.com/LambdaLabsML/distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

31 Jul 2024 190

https://github.com/ZephirFXEC/HNanoSolver

Houdini GPU Fluid Solver powered by NanoVDB

29 Jul 2024 7

https://github.com/Kentakoong/mtnlog

A simple multinode performance logger for Python

29 Jul 2024 0

https://github.com/neoheartbeats/neoheartbeats-kernel

An architecture for LLMs' continual-learning and long-term memories

26 Jul 2024 4

https://github.com/ACDSLab/MPPI-Generic

Templated C++/CUDA implementation of Model Predictive Path Integral Control (MPPI)

24 Jul 2024 19

https://github.com/bkraad47/fat_llama

fat_llama is a Python package for upscaling audio files to FLAC or WAV formats using advanced audio processing techniques

21 Jul 2024 5

PyAV-CUDA

Extension of PyAV (ffmpeg bindings) with hardware decoding support

15 Jul 2024 0

tinyGPUlang

Tutorial on building a gpu compiler backend in LLVM

14 Jul 2024 7

whisper-onnx-python

A low-footprint GPU accelerated Speech to Text Python package for the Jetpack 5 era bolstered by an optimized graph

24 Jun 2024 1

Infero

An easy to use, high performant CUDA powered LLM inference library

05 Jun 2024 12

voice-gulliver

The best gradio web-ui for ai subtitle, translation and dubbing

05 Jun 2024 1

vacancies_server

This is a server for vacancies generation using LLM (Saiga3)

03 May 2024 1

Sparky-2

This is a discord bot running on llama cpp with the llama 3 model and image geneartion

27 Apr 2024 5

KuiperLLama

校招、秋招、春招、实习好项目，带你从零动手实现支持LLama的大模型推理框架。

25 Apr 2024 191

More Upcoming

Community

More

Hendrik Ranocha

Assistant Professor analyzing & developing nume...

27 projects

Abdul Fatir

Applied Scientist, @aws

18 projects

Ivan Gabriele

Passionate about everything.

69 projects

Bensuperpc

Hardware and Software Enthusiast | C++ and Pyth...

48 projects

Xavier Dupré

@ENSAE 1999, Phd in Computer Science in 2004, @...

21 projects

Anjan Roy

Learning :)

36 projects

Ben Firshman

46 projects

Ron Evans

Technologist for hire, open source software dev...

27 projects

Maximilian Goisser

Jack of all trades (- master of some?)

20 projects

Mattt

19 projects

Matthew Feickert

Postdoc in high energy physics and data science...

30 projects

jakirkham

17 projects

Matthias Endler

Curious person; oxidizing things.

40 projects

Sergii Dymchenko

19 projects

xavier dupré

@ENSAE 1999, Phd in Computer Science in 2004, @...

16 projects

mani

4X @Kaggle Expert @Java champion, Polyglot, Sof...

20 projects

Toru Niina

Ph.D. (Science). Interested in computational bi...

17 projects

Michael Royal

Senior Software Engineer by day. A Linux and O...

89 projects

Zeke Sikelianos

"Zeek". Machine learner at @replicate. Open sou...

123 projects

Alexis Montoison

17 projects

Lev E. Givon

Senior Data Scientist / Machine Learning Resear...

15 projects

Andreas Jansson

Machine learning and music

24 projects

Trevor L. McDonell

17 projects

Phil Wang

Working with Attention. It's all we need

234 projects

Nikolay Dubina

28 projects

View All Experts