Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)
BSD-3-CLAUSE License
An architecture for LLMs' continual-learning and long-term memories
Instant-ngp in pytorch+cuda trained with pytorch-lightning (high quality with high speed, with on...
3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!
A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources
Fast, reproducible, and portable software development environments
Matrix multiplication example performed with OpenMP, OpenACC, BLAS, cuBLABS, and CUDA
Real-time large scale dense visual SLAM system
CUDA C++ Core Libraries
Object detection for video surveillance
A high-performance inference system for large language models, designed for production environments.
SDK for GPU accelerated genome assembly and analysis
A git repository containing an NLP example using DL4J (cuda) in Java
The fastest way to compute matrix profiles on CPU and GPU!
Simple tests for JAX, PyTorch, and TensorFlow to test if the installed NVIDIA drivers are being p...
Deep Learning Docker Image