Weighted MinHash implementation on CUDA (multi-gpu).
OTHER License
Simple tests for JAX, PyTorch, and TensorFlow to test if the installed NVIDIA drivers are being p...
Unifying Python/C++/CUDA memory: Python buffered array ↔️ `std::vector` ↔️ CUDA managed memory
A general cubic equation solver and quartic equation minimisation solver written for CPU and Nvid...
Large scale K-means and K-nn implementation on NVIDIA GPU / CUDA
Some CUDA design patterns and a bit of template magic for CUDA
A CUDA Extension of Neural Network Libraries
Classes enabling finmath-lib to run its Monte-Carlo models on Cuda GPUs
CUDA C++ Core Libraries
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
SYCL accelerated BLAKE3 Hash Implementation
Safe rust wrapper around CUDA toolkit
Low-latency CUDA JPEG decoder by parallelizing Huffman decoding
Cuda-based matrix/vector computations
BQN virtual machine
Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mi...