CLTune: An automatic OpenCL & CUDA kernel tuner
OTHER License
Statistics for this project are still being loaded, please check back later.
A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources
Simple tests for JAX, PyTorch, and TensorFlow to test if the installed NVIDIA drivers are being p...
3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!
Templated C++/CUDA implementation of Model Predictive Path Integral Control (MPPI)
A highly optimised C++ library for mathematical applications and neural networks.
Some CUDA design patterns and a bit of template magic for CUDA
An architecture for LLMs' continual-learning and long-term memories
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Programmable CUDA/C++ GPU Graph Analytics
Classes enabling finmath-lib to run its Monte-Carlo models on Cuda GPUs
SDK for GPU accelerated genome assembly and analysis
CUDA C++ Core Libraries
A KMeans implemented in C++ with Python bindings and GPU acceleration
Kernel Tuner
Abstraction Library for Parallel Kernel Acceleration