Matrix multiplication example performed with OpenMP, OpenACC, BLAS, cuBLABS, and CUDA
BSD-3-CLAUSE License
Statistics for this project are still being loaded, please check back later.
Fast, reproducible, and portable software development environments
GPU PyTorch TOP in TouchDesigner with CUDA-enabled OpenCV
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
BQN virtual machine
Ubuntu PyTorch CUDA Docker image with KDE Plasma Desktop & VNC. Ideal for LLM & Deep Learning rem...
Simple tests for JAX, PyTorch, and TensorFlow to test if the installed NVIDIA drivers are being p...
A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface
SDK for GPU accelerated genome assembly and analysis
Provide Docker build sequences of Open3D for various environments.
Object detection for video surveillance
Provide Docker build sequences of PyTorch for various environments.
RealSense execution environment built on a Docker container on Ubuntu 20.04. NIVIDA GPU and OpenG...
Dockerfiles and manual for easy build of docker image with CUDA10.X and cuDNN7.6 to run TensorFlo...
Deep Learning Docker Image
Provides an environment for compiling TensorFlow or PyTorch with CUDA for aarch64 on an x86 machi...