Best practices & guides on how to write distributed pytorch training code
A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface
Glasses detection, classification and segmentation
🎉 Modern CUDA Learn Notes with PyTorch: fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm
Instant-ngp in pytorch+cuda trained with pytorch-lightning (high quality with high speed, with only few lines of legible code)
Yolov5 Object Detection In OSRS using Python code, Detecting Cows - Botting
Provides an environment for compiling TensorFlow or PyTorch with CUDA for aarch64 on an x86 machine
Simple tests for JAX, PyTorch, and TensorFlow to test if the installed NVIDIA drivers are being properly picked up
Install PyTorch distributions with computation backend auto-detection
A PyTorch Library for Accelerating 3D Deep Learning Research