Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Statistics for this project are still being loaded, please check back later.
SDK for GPU accelerated genome assembly and analysis
Object detection for video surveillance
Provides an environment for compiling TensorFlow or PyTorch with CUDA for aarch64 on an x86 machi...
Provide Docker build sequences of Open3D for various environments.
Dockerfiles and manual for easy build of docker image with CUDA10.X and cuDNN7.6 to run TensorFlo...
Solvers/annealers for simulated quantum annealing on CPU and CUDA(NVIDIA GPU).
Deep Learning Docker Image
Templated C++/CUDA implementation of Model Predictive Path Integral Control (MPPI)
Provide Docker build sequences of PyTorch for various environments.
Matrix multiplication example performed with OpenMP, OpenACC, BLAS, cuBLABS, and CUDA
Simple tests for JAX, PyTorch, and TensorFlow to test if the installed NVIDIA drivers are being p...
CUDA C++ Core Libraries
Fast, reproducible, and portable software development environments
A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface
GPU PyTorch TOP in TouchDesigner with CUDA-enabled OpenCV