High-Performance Rendering Framework on Stream Architectures
BSD-3-CLAUSE License
SYCL accelerated BLAKE3 Hash Implementation
Real-time dense visual SLAM system
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofl...
AutoDock for GPUs and other accelerators
BQN virtual machine
Some CUDA design patterns and a bit of template magic for CUDA
Rust bindings to the NVIDIA NVBIT binary instrumentation API
A library to create kaleidoscope effect on images with CUDA. You can build on all platforms using...
Agenium Scale vectorization library for CPUs and GPUs
Real-time large scale dense visual SLAM system
Simple experimental async GPGPU framework for Rust
CUDA C++ Core Libraries
an implementation of parallel linear BVH (LBVH) on GPU
An unofficial cuda assembler, for all generations of SASS, hopefully :)
Large scale K-means and K-nn implementation on NVIDIA GPU / CUDA