A brian2 extension to simulate spiking neural networks on GPUs
SDK for GPU accelerated genome assembly and analysis
A collection of GICP-based fast point cloud registration algorithms
A highly optimised C++ library for mathematical applications and neural networks
A data-parallel functional programming language
A high-performance inference system for large language models, designed for production environments
Unifying Python/C++/CUDA memory: Python buffered array ↔️ `std::vector` ↔️ CUDA managed memory
The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with PennyLane
ThunderGBM: Fast GBDTs and Random Forests on GPUs
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface