A PyTorch Library for Accelerating 3D Deep Learning Research
VUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writing GPU-accelerated applications
🎉 Modern CUDA Learn Notes with PyTorch: fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm
A brian2 extension to simulate spiking neural networks on GPUs
SDK for GPU accelerated genome assembly and analysis
A collection of GICP-based fast point cloud registration algorithms
A highly optimised C++ library for mathematical applications and neural networks