Infero

An easy to use, high performant CUDA powered LLM inference library.

BSD-3-CLAUSE License

Stars

Committers

View Code on GitHub View on X

Ecosystems: Llama, Windows, Cuda

Infero - Infero Runtime Dependencies Latest Release

Published by jarroddavis68 5 months ago

The following CUDA & llama.cpp runtime DLLs must be present on your target device for Infero to function properly. Please ensure they are placed in the same directory as your Infero executable file.

Badges

Extracted from project README

Related Projects

penguinV

Computer vision library with focus on heterogeneous systems

09 Jun 2016 118

https://github.com/LLNL/hiop

HPC solver for nonlinear optimization problems

05 Dec 2017 209

awesome-gpgpu

A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources

20 Jun 2018 63

llama-dfdx

LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!

28 Apr 2023 100

watsor

Object detection for video surveillance

20 Jun 2020 244

bmf

Cross-platform, customizable multimedia/video processing framework. With strong GPU acceleration...

15 Jul 2023 773

ScaleLLM

A high-performance inference system for large language models, designed for production environments.

24 Jul 2023 289

librapid

A highly optimised C++ library for mathematical applications and neural networks.

25 May 2021 163

cccl

CUDA C++ Core Libraries

17 Sep 2020 743

awesome-rust-list

This repository lists some awesome public Rust projects, Videos, Blogs and Jobs.

07 Aug 2022 34

https://github.com/finmath/finmath-lib-cuda-extensions

Classes enabling finmath-lib to run its Monte-Carlo models on Cuda GPUs

18 Jun 2017 8

https://github.com/mp3guy/Kintinuous

Real-time large scale dense visual SLAM system

22 Oct 2015 913

https://github.com/mp3guy/ElasticFusion

Real-time dense visual SLAM system

22 Oct 2015 1,772

https://github.com/neoheartbeats/neoheartbeats-kernel

An architecture for LLMs' continual-learning and long-term memories

26 Jul 2024 4

https://github.com/MrNeRF/gaussian-splatting-cuda

3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!

30 Jul 2023 862