llama.go

llama.go is like llama.cpp in pure Golang!

OTHER License

Stars
1.2K
Committers
3

Bot releases are hidden (Show)

llama.go - v1.4: Server Mode Latest Release

Published by gotzmann over 1 year ago

Introducing a Server Mode enabling easy-to-use REST API for the inner GPT model. Let's go Production :)

llama.go - Better Defaults

Published by gotzmann over 1 year ago

Nothing special, more stable inference and more sane default parameters

llama.go - AVX2 and NEON

Published by gotzmann over 1 year ago

Inference performance was boosted for CPUs supporting vector math.

Please use:

--neon flag for Apple Silicon (M1-M3 processors) and ARM servers

--avx for Intel and AMD CPUs which supports AVX2 instruction set

llama.go - Big Models are OK

Published by gotzmann over 1 year ago

This version supports bigger / multipart LLaMA models (tested with 7B / 13B) converted into latest GGMJ binary format with custom Python script (see README).

llama.go - April 12 - First Man in Space

Published by gotzmann over 1 year ago

The very first public release of LLaMA.go

Package Rankings
Top 4.46% on Proxy.golang.org
Related Projects