Gradient Descent Optimizers and Genetic Algorithms using GPUs, CPUs, and FPGAs via CUDA, OpenCL, and oneAPI
GPL-3.0 License
Published by BrosnanYuen about 1 year ago
First Release
The fastest Tropical number matrix multiplication on GPU
Programmable CUDA/C++ GPU Graph Analytics
A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-L...
An architecture for LLMs' continual-learning and long-term memories
A simple yet sufficiently fast (attenuated) Radon and backproject implementation using KernelAbst...
Some CUDA design patterns and a bit of template magic for CUDA
3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!
A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources
Sparse Boolean linear algebra for Nvidia Cuda, OpenCL and CPU computations
CUDA C++ Core Libraries
NumPy实现类PyTorch的动态计算图和神经网络框架(MLP, CNN, RNN, Transformer)
Classes enabling finmath-lib to run its Monte-Carlo models on Cuda GPUs
An unofficial Julia wrapper for the RAPIDS.ai ecosystem using PythonCall.jl
This repository lists some awesome public Rust projects, Videos, Blogs and Jobs.
Weighted MinHash implementation on CUDA (multi-gpu).