Autotuning NVCC Compiler Parameters, published @ CCPE Journal
LGPL-3.0 License
Kernel Tuner
A CUDA Extension of Neural Network Libraries
HPC solver for nonlinear optimization problems
Playing with CUDA and GPUs in Google Colab
Provides an environment for compiling TensorFlow or PyTorch with CUDA for aarch64 on an x86 machi...
Par4All is an automatic parallelizing and optimizing compiler (workbench) for C and Fortran seque...
cuda编程学习入门
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Some CUDA design patterns and a bit of template magic for CUDA
Python library for fast time-series analysis on CUDA GPUs
SDK for GPU accelerated genome assembly and analysis
CLTune: An automatic OpenCL & CUDA kernel tuner
Templated C++/CUDA implementation of Model Predictive Path Integral Control (MPPI)
GPU PyTorch TOP in TouchDesigner with CUDA-enabled OpenCV
Classes enabling finmath-lib to run its Monte-Carlo models on Cuda GPUs