An unofficial cuda assembler, for all generations of SASS, hopefully :)
MIT License
Rust bindings to the NVIDIA NVBIT binary instrumentation API
Safe rust wrapper around CUDA toolkit
High-Performance Rendering Framework on Stream Architectures
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofl...
Real-time large scale dense visual SLAM system
Agenium Scale vectorization library for CPUs and GPUs
Fortran interfaces for ROCm libraries
BQN virtual machine
A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface
Weighted MinHash implementation on CUDA (multi-gpu).
Some CUDA design patterns and a bit of template magic for CUDA
💣 SMH – a computer vision project for automatic, precision mortar strike calculations in Squad
CUDA C++ Core Libraries
Achieve peak performance on x86 CPUs and NVIDIA GPUs
Haskell FFI bindings to CUDA