A high-performance inference system for large language models, designed for production environments.
APACHE-2.0 License
A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources
Sparse Boolean linear algebra for Nvidia Cuda, OpenCL and CPU computations
CUDA C++ Core Libraries
The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with...
LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!
ezlocalai is an easy to set up local artificial intelligence server with OpenAI Style Endpoints.
An architecture for LLMs' continual-learning and long-term memories
Cross-platform, customizable multimedia/video processing framework. With strong GPU acceleration...
3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!
Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package wi...
A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-L...
This repository lists some awesome public Rust projects, Videos, Blogs and Jobs.
An easy to use, high performant CUDA powered LLM inference library.
Object detection for video surveillance
A git repository containing an NLP example using DL4J (cuda) in Java