ILGPU JIT Compiler for high-performance .Net GPU programs
OTHER License
Simple yet fancy GPU architecture fetching tool
Achieve peak performance on x86 CPUs and NVIDIA GPUs
Tutorial on building a gpu compiler backend in LLVM
A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources
BQN virtual machine
An architecture for LLMs' continual-learning and long-term memories
Some CUDA design patterns and a bit of template magic for CUDA
Matrix multiplication example performed with OpenMP, OpenACC, BLAS, cuBLABS, and CUDA
Abstraction Library for Parallel Kernel Acceleration
Simple experimental async GPGPU framework for Rust
Computer vision library with focus on heterogeneous systems
DCompute: Native execution of D on GPUs and other Accelerators
Simple tests for JAX, PyTorch, and TensorFlow to test if the installed NVIDIA drivers are being p...
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mi...