Some CUDA design patterns and a bit of template magic for CUDA
Large scale K-means and K-nn implementation on NVIDIA GPU / CUDA
Classes enabling finmath-lib to run its Monte-Carlo models on Cuda GPUs
VUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writ...
Weighted MinHash implementation on CUDA (multi-gpu).
BQN virtual machine
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofl...
CLTune: An automatic OpenCL & CUDA kernel tuner
A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources
Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mi...
Achieve peak performance on x86 CPUs and NVIDIA GPUs
Rust bindings to the NVIDIA NVBIT binary instrumentation API
CUDA C++ Core Libraries
A library to create kaleidoscope effect on images with CUDA. You can build on all platforms using...
Simple tests for JAX, PyTorch, and TensorFlow to test if the installed NVIDIA drivers are being p...
A small utility for getting some info post-hoc about a program's run.