The fastest Tropical number matrix multiplication on GPU
MIT License
Some CUDA design patterns and a bit of template magic for CUDA
Simple tests for JAX, PyTorch, and TensorFlow to test if the installed NVIDIA drivers are being p...
A simple yet sufficiently fast (attenuated) Radon and backproject implementation using KernelAbst...
Abstraction Library for Parallel Kernel Acceleration
Sparse Boolean linear algebra for Nvidia Cuda, OpenCL and CPU computations
3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!
Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package wi...
A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources
BQN virtual machine
Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mi...
CUDA C++ Core Libraries
A highly optimised C++ library for mathematical applications and neural networks.
A collection of GICP-based fast point cloud registration algorithms
Classes enabling finmath-lib to run its Monte-Carlo models on Cuda GPUs
Cuda-based matrix/vector computations