Canny edge detector implemented in CUDA C/C++
Templated C++/CUDA implementation of Model Predictive Path Integral Control (MPPI)
🎉 Modern CUDA Learn Notes with PyTorch: fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm
A general cubic equation solver and quartic equation minimisation solver written for CPU and Nvidia GPUs, for more details and results, see: https://arxiv
SDK for GPU accelerated genome assembly and analysis
Ising: a Python package for exactly solving abritrary Ising model instances using exhaustive search
In this code is provided a simple, efficient and fast method to calculate motion and backgroud dynamically using nVidia GPUs power