an implementation of parallel linear BVH (LBVH) on GPU
MIT License
Some CUDA design patterns and a bit of template magic for CUDA
Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mi...
High-Performance Rendering Framework on Stream Architectures
BQN virtual machine
Extending JAX with custom C++ and CUDA code
Large scale K-means and K-nn implementation on NVIDIA GPU / CUDA
Simple experimental async GPGPU framework for Rust
CUDA C++ Core Libraries
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofl...
A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources
VUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writ...
Rust bindings to the NVIDIA NVBIT binary instrumentation API
SYCL accelerated BLAKE3 Hash Implementation
Programmable CUDA/C++ GPU Graph Analytics
Weighted MinHash implementation on CUDA (multi-gpu).