Bot releases are hidden (Show)

LocalAI - v2.17.1 Latest Release

Published by mudler 4 months ago

Highlights

This is a patch release to address issues with Linux single binary releases. It also adds support for Stable diffusion 3!

Stable diffusion 3

You can use Stable diffusion 3 by installing the model in the gallery (stable-diffusion-3-medium) or by placing this YAML file in the model folder:

backend: diffusers
diffusers:
  cuda: true
  enable_parameters: negative_prompt,num_inference_steps
  pipeline_type: StableDiffusion3Pipeline
f16: false
name: sd3
parameters:
  model: v2ray/stable-diffusion-3-medium-diffusers
step: 25

You can try then generating an image:

http://localhost:9091/v1/images/generations -H "Content-Type: application/json" -d '{
  "prompt": "A cute baby sea otter", "model": "sd3"
}

Example result:

b64514236520

What's Changed

Bug fixes 🐛

fix(single-binary): bundle ld.so by @mudler in https://github.com/mudler/LocalAI/pull/2602

Exciting New Features 🎉

feat(sd-3): add stablediffusion 3 support by @mudler in https://github.com/mudler/LocalAI/pull/2591
feat(talk): display an informative box, better colors by @mudler in https://github.com/mudler/LocalAI/pull/2600

📖 Documentation and examples

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/2593

👒 Dependencies

⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2594

Other Changes

⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2603

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.17.0...v2.17.1

LocalAI - v2.17.0

Published by mudler 4 months ago

local-ai-release-2 17-shadow
Ahoj! this new release of LocalAI comes with tons of updates, and enhancements behind the scenes!

🌟 Highlights TLDR;

Automatic identification of GGUF models
New WebUI page to talk with an LLM!
https://models.localai.io is live! 🚀
Better arm64 and Apple silicon support
More models to the gallery!
New quickstart installer script
Enhancements to mixed grammar support
Major improvements to transformers
Linux single binary now supports rocm, nvidia, and intel

🤖 Automatic model identification for llama.cpp-based models

Just drop your GGUF files into the model folders, and let LocalAI handle the configurations. YAML files are now reserved for those who love to tinker with advanced setups.

🔊 Talk to your LLM!

Introduced a new page that allows direct interaction with the LLM using audio transcription and TTS capabilities. This feature is so fun - now you can just talk with any LLM with a couple of clicks away.
Screenshot from 2024-06-08 12-44-41

🍏 Apple single-binary

Experience enhanced support for the Apple ecosystem with a comprehensive single-binary that packs all necessary libraries, ensuring LocalAI runs smoothly on MacOS and ARM64 architectures.

ARM64

Expanded our support for ARM64 with new Docker images and single binary options, ensuring better compatibility and performance on ARM-based systems.

Note: currently we support only arm core images, for instance: localai/localai:master-ffmpeg-core, localai/localai:latest-ffmpeg-core, localai/localai:v2.17.0-ffmpeg-core.

🐞 Bug Fixes and small enhancements

We’ve ironed out several issues, including image endpoint response types and other minor problems, boosting the stability and reliability of our applications. It is now also possible to enable CSRF when starting LocalAI, thanks to @dave-gray101.

🌐 Models and Galleries

Enhanced the model gallery with new additions like Mirai Nova, Mahou, and several updates to existing models ensuring better performance and accuracy.

Now you can check new models also in https://models.localai.io, without running LocalAI!

Installation and Setup

A new install.sh script is now available for quick and hassle-free installations, streamlining the setup process for new users.

curl https://localai.io/install.sh | sh

Installation can be configured with Environment variables, for example:

curl https://localai.io/install.sh | VAR=value sh

List of the Environment Variables:

DOCKER_INSTALL: Set to "true" to enable the installation of Docker images.
USE_AIO: Set to "true" to use the all-in-one LocalAI Docker image.
API_KEY: Specify an API key for accessing LocalAI, if required.
CORE_IMAGES: Set to "true" to download core LocalAI images.
PORT: Specifies the port on which LocalAI will run (default is 8080).
THREADS: Number of processor threads the application should use. Defaults to the number of logical cores minus one.
VERSION: Specifies the version of LocalAI to install. Defaults to the latest available version.
MODELS_PATH: Directory path where LocalAI models are stored (default is /usr/share/local-ai/models).

We are looking into improving the installer, and as this is a first iteration any feedback is welcome! Open up an issue if something doesn't work for you!

Enhancements to mixed grammar support

Mixed grammar support continues receiving improvements behind the scenes.

🐍 Transformers backend enhancements

Temperature = 0 correctly handled as greedy search
Handles custom words as stop words
Implement KV cache
Phi 3 no more requires trust_remote_code: true flag

Shout-out to @fakezeta for these enhancements!

Install models with the CLI

Now the CLI can install models directly from the gallery. For instance:

local-ai run <model_name_in gallery>

This command ensures the model is installed in the model folder at startup.

🐧 Linux single binary now supports rocm, nvidia, and intel

Single binaries for Linux now contain Intel, AMD GPU, and NVIDIA support. Note that you need to install the dependencies separately in the system to leverage these features. In upcoming releases, this requirement will be handled by the installer script.

📣 Let's Make Some Noise!

A gigantic THANK YOU to everyone who’s contributed—your feedback, bug squashing, and feature suggestions are what make LocalAI shine. To all our heroes out there supporting other users and sharing their expertise, you’re the real MVPs!

Remember, LocalAI thrives on community support—not big corporate bucks. If you love what we're building, show some love! A shoutout on social (@LocalAI_OSS and @mudler_it on twitter/X), joining our sponsors, or simply starring us on GitHub makes all the difference.

Also, if you haven't yet joined our Discord, come on over! Here's the link: https://discord.gg/uJAeKSAGDy

Thanks a ton, and.. enjoy this release!

What's Changed

Bug fixes 🐛

fix: gpu fetch device info by @sozercan in https://github.com/mudler/LocalAI/pull/2403
fix(watcher): do not emit fatal errors by @mudler in https://github.com/mudler/LocalAI/pull/2410
fix: install pytorch from proper index for hipblas builds by @cryptk in https://github.com/mudler/LocalAI/pull/2413
fix: pin version of setuptools for intel builds to work around #2406 by @cryptk in https://github.com/mudler/LocalAI/pull/2414
bugfix: CUDA acceleration not working by @fakezeta in https://github.com/mudler/LocalAI/pull/2475
fix: pkg/downloader should respect basePath for file:// urls by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2481
fix: chat webui response parsing by @sozercan in https://github.com/mudler/LocalAI/pull/2515
fix(stream): do not break channel consumption by @mudler in https://github.com/mudler/LocalAI/pull/2517
fix(Makefile): enable STATIC on dist by @mudler in https://github.com/mudler/LocalAI/pull/2569

Exciting New Features 🎉

feat(images): do not install python deps in the core image by @mudler in https://github.com/mudler/LocalAI/pull/2425
feat(hipblas): extend default hipblas GPU_TARGETS by @mudler in https://github.com/mudler/LocalAI/pull/2426
feat(build): add arm64 core containers by @mudler in https://github.com/mudler/LocalAI/pull/2421
feat(functions): allow parallel calls with mixed/no grammars by @mudler in https://github.com/mudler/LocalAI/pull/2432
feat(image): support response_type in the OpenAI API request by @prajwalnayak7 in https://github.com/mudler/LocalAI/pull/2347
feat(swagger): update swagger by @localai-bot in https://github.com/mudler/LocalAI/pull/2436
feat(functions): better free string matching, allow to expect strings after JSON by @mudler in https://github.com/mudler/LocalAI/pull/2445
build(Makefile): add back single target to build native llama-cpp by @mudler in https://github.com/mudler/LocalAI/pull/2448
feat(functions): allow response_regex to be a list by @mudler in https://github.com/mudler/LocalAI/pull/2447
TTS API improvements by @blob42 in https://github.com/mudler/LocalAI/pull/2308
feat(transformers): various enhancements to the transformers backend by @fakezeta in https://github.com/mudler/LocalAI/pull/2468
feat(webui): enhance card visibility by @mudler in https://github.com/mudler/LocalAI/pull/2473
feat(default): use number of physical cores as default by @mudler in https://github.com/mudler/LocalAI/pull/2483
feat: fiber CSRF by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2482
feat(amdgpu): try to build in single binary by @mudler in https://github.com/mudler/LocalAI/pull/2485
feat:OpaqueErrors to hide error information by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2486
build(intel): bundle intel variants in single-binary by @mudler in https://github.com/mudler/LocalAI/pull/2494
feat(install): add install.sh for quick installs by @mudler in https://github.com/mudler/LocalAI/pull/2489
feat(llama.cpp): guess model defaults from file by @mudler in https://github.com/mudler/LocalAI/pull/2522
feat(ui): add page to talk with voice, transcription, and tts by @mudler in https://github.com/mudler/LocalAI/pull/2520
feat(arm64): enable single-binary builds by @mudler in https://github.com/mudler/LocalAI/pull/2490
feat(util): add util command to print GGUF informations by @mudler in https://github.com/mudler/LocalAI/pull/2528
feat(defaults): add defaults for Command-R models by @mudler in https://github.com/mudler/LocalAI/pull/2529
feat(detection): detect by template in gguf file, add qwen2, phi, mistral and chatml by @mudler in https://github.com/mudler/LocalAI/pull/2536
feat(gallery): show available models in website, allow local-ai models install to install from galleries by @mudler in https://github.com/mudler/LocalAI/pull/2555
feat(gallery): uniform download from CLI by @mudler in https://github.com/mudler/LocalAI/pull/2559
feat(guesser): identify gemma models by @mudler in https://github.com/mudler/LocalAI/pull/2561
feat(binary): support extracted bundled libs on darwin by @mudler in https://github.com/mudler/LocalAI/pull/2563
feat(darwin): embed grpc libs by @mudler in https://github.com/mudler/LocalAI/pull/2567
feat(build): bundle libs for arm64 and x86 linux binaries by @mudler in https://github.com/mudler/LocalAI/pull/2572
feat(libpath): refactor and expose functions for external library paths by @mudler in https://github.com/mudler/LocalAI/pull/2578

🧠 Models

models(gallery): add Mirai Nova by @mudler in https://github.com/mudler/LocalAI/pull/2405
models(gallery): add Mahou by @mudler in https://github.com/mudler/LocalAI/pull/2411
models(gallery): add minicpm by @mudler in https://github.com/mudler/LocalAI/pull/2412
models(gallery): add poppy porpoise 0.85 by @mudler in https://github.com/mudler/LocalAI/pull/2415
models(gallery): add alpha centauri by @mudler in https://github.com/mudler/LocalAI/pull/2416
models(gallery): add cream-phi-13b by @mudler in https://github.com/mudler/LocalAI/pull/2417
models(gallery): add stheno-mahou by @mudler in https://github.com/mudler/LocalAI/pull/2418
models(gallery): add iterative-dpo, fix minicpm by @mudler in https://github.com/mudler/LocalAI/pull/2422
models(gallery): add una-thepitbull by @mudler in https://github.com/mudler/LocalAI/pull/2435
models(gallery): add halu by @mudler in https://github.com/mudler/LocalAI/pull/2434
models(gallery): add neuraldaredevil by @mudler in https://github.com/mudler/LocalAI/pull/2439
models(gallery): add Codestral by @mudler in https://github.com/mudler/LocalAI/pull/2442
models(gallery): add mopeymule by @mudler in https://github.com/mudler/LocalAI/pull/2449
models(gallery): ⬆️ update checksum by @localai-bot in https://github.com/mudler/LocalAI/pull/2451
models(gallery): add anjir by @mudler in https://github.com/mudler/LocalAI/pull/2454
models(gallery): add llama3-11b by @mudler in https://github.com/mudler/LocalAI/pull/2455
models(gallery): add ultron by @mudler in https://github.com/mudler/LocalAI/pull/2456
models(gallery): add poppy porpoise 1.0 by @mudler in https://github.com/mudler/LocalAI/pull/2459
models(gallery): add Neural SOVLish Devil by @mudler in https://github.com/mudler/LocalAI/pull/2460
models(gallery): add all whisper variants by @mudler in https://github.com/mudler/LocalAI/pull/2462
models(gallery): ⬆️ update checksum by @localai-bot in https://github.com/mudler/LocalAI/pull/2463
models(gallery): add gemma-2b by @mudler in https://github.com/mudler/LocalAI/pull/2466
models(gallery): add fimbulvetr iqmatrix version by @mudler in https://github.com/mudler/LocalAI/pull/2470
models(gallery): add new poppy porpoise versions by @mudler in https://github.com/mudler/LocalAI/pull/2471
models(gallery): add dolphin-2.9.2-Phi-3-Medium by @mudler in https://github.com/mudler/LocalAI/pull/2492
models(gallery): add dolphin-2.9.2-phi-3-Medium-abliterated by @mudler in https://github.com/mudler/LocalAI/pull/2495
models(gallery): add nyun by @mudler in https://github.com/mudler/LocalAI/pull/2496
models(gallery): add phi-3-4x4b by @mudler in https://github.com/mudler/LocalAI/pull/2497
models(gallery): add llama-3-instruct-8b-SimPO-ExPO by @mudler in https://github.com/mudler/LocalAI/pull/2498
models(gallery): add Llama-3-Yggdrasil-2.0-8B by @mudler in https://github.com/mudler/LocalAI/pull/2499
models(gallery): add l3-8b-stheno-v3.2-iq-imatrix by @mudler in https://github.com/mudler/LocalAI/pull/2500
models(gallery): add llama3-8B-aifeifei-1.0-iq-imatrix by @mudler in https://github.com/mudler/LocalAI/pull/2509
models(gallery): add rawr_llama3_8b-iq-imatrix by @mudler in https://github.com/mudler/LocalAI/pull/2510
models(gallery): add llama3-8b-feifei-1.0-iq-imatrix by @mudler in https://github.com/mudler/LocalAI/pull/2511
models(gallery): ⬆️ update checksum by @localai-bot in https://github.com/mudler/LocalAI/pull/2519
models(gallery): add llama3-8B-aifeifei-1.2-iq-imatrix by @mudler in https://github.com/mudler/LocalAI/pull/2544
models(gallery): add hathor-l3-8b-v.01-iq-imatrix by @mudler in https://github.com/mudler/LocalAI/pull/2545
models(gallery): add l3-aethora-15b by @mudler in https://github.com/mudler/LocalAI/pull/2546
models(gallery): add llama-salad-8x8b by @mudler in https://github.com/mudler/LocalAI/pull/2547
models(gallery): add average_normie_v3.69_8b-iq-imatrix by @mudler in https://github.com/mudler/LocalAI/pull/2548
models(gallery): add duloxetine by @mudler in https://github.com/mudler/LocalAI/pull/2549
models(gallery): add badger-lambda-llama-3-8b by @mudler in https://github.com/mudler/LocalAI/pull/2550
models(gallery): add firefly-gemma-7b by @mudler in https://github.com/mudler/LocalAI/pull/2576
models(gallery): add dolphin-qwen by @mudler in https://github.com/mudler/LocalAI/pull/2580
models(gallery): add tess-v2.5-phi-3-medium-128k-14b by @mudler in https://github.com/mudler/LocalAI/pull/2581
models(gallery): add hathor_stable-v0.2-l3-8b by @mudler in https://github.com/mudler/LocalAI/pull/2582
models(gallery): add samantha-qwen2 by @mudler in https://github.com/mudler/LocalAI/pull/2586
models(gallery): add gemma-1.1-7b-it by @mudler in https://github.com/mudler/LocalAI/pull/2588

📖 Documentation and examples

Update quickstart.md by @mudler in https://github.com/mudler/LocalAI/pull/2404
docs: fix p2p commands by @mudler in https://github.com/mudler/LocalAI/pull/2472
README: update sponsors list by @mudler in https://github.com/mudler/LocalAI/pull/2476
Add integrations by @reid41 in https://github.com/mudler/LocalAI/pull/2535
docs(gallery): lazy-load images by @mudler in https://github.com/mudler/LocalAI/pull/2557
Fix standard image latest Docker tags by @nwithan8 in https://github.com/mudler/LocalAI/pull/2574

👒 Dependencies

⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2399
⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/2398
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2408
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2409
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2419
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2427
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2428
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2433
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2437
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2438
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2444
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2443
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2452
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2453
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2465
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2467
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2477
toil: bump grpc version by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2480
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2487
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2493
deps(whisper): update, add libcufft-dev by @mudler in https://github.com/mudler/LocalAI/pull/2501
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2507
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2508
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2518
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2524
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2531
chore(deps): Update Dockerfile by @reneleonhardt in https://github.com/mudler/LocalAI/pull/2532
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2539
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2540
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2552
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2551
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2554
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2564
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2565
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2570
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2575
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2584
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2587

Other Changes

ci: fix sd release by @sozercan in https://github.com/mudler/LocalAI/pull/2400
ci(grpc-cache): also arm64 by @mudler in https://github.com/mudler/LocalAI/pull/2423
ci: push test images when building PRs by @mudler in https://github.com/mudler/LocalAI/pull/2424
ci: pin build-time protoc by @mudler in https://github.com/mudler/LocalAI/pull/2461
feat(swagger): update swagger by @localai-bot in https://github.com/mudler/LocalAI/pull/2464
ci: run release build on self-hosted runners by @mudler in https://github.com/mudler/LocalAI/pull/2505
experiment: -j4 for build-linux: by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2514
test: e2e /reranker endpoint by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2211
ci: pack less libs inside the binary by @mudler in https://github.com/mudler/LocalAI/pull/2579

New Contributors

@prajwalnayak7 made their first contribution in https://github.com/mudler/LocalAI/pull/2347
@reneleonhardt made their first contribution in https://github.com/mudler/LocalAI/pull/2532
@reid41 made their first contribution in https://github.com/mudler/LocalAI/pull/2535
@nwithan8 made their first contribution in https://github.com/mudler/LocalAI/pull/2574

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.16.0...v2.17.0

LocalAI - v2.16.0

Published by mudler 5 months ago

local-ai-release-2 16

Welcome to LocalAI's latest update!

🎉🎉🎉 woot woot! So excited to share this release, a lot of new features are landing in LocalAI!!!!! 🎉🎉🎉

🌟 Introducing Distributed Llama.cpp Inferencing

Now it is possible to distribute the inferencing workload across different workers with llama.cpp models !

This feature has landed with https://github.com/mudler/LocalAI/pull/2324 and is based on the upstream work of @rgerganov in https://github.com/ggerganov/llama.cpp/pull/6829.

How it works: a front-end server manages the requests compatible with the OpenAI API (LocalAI) and workers (llama.cpp) are used to distribute the workload. This makes possible to run larger models split across different nodes!

How to use it

To start workers to offload the computation you can run:

local-ai llamacpp-worker <listening_address> <listening_port>

However, you can also follow the llama.cpp README and building the rpc-server (https://github.com/ggerganov/llama.cpp/blob/master/examples/rpc/README.md), which is still compatible with LocalAI.

When starting the LocalAI server, which is going to accept the API requests, you can set a list of workers IP/address by specifying the addresses with LLAMACPP_GRPC_SERVERS:

LLAMACPP_GRPC_SERVERS="address1:port,address2:port" local-ai run

At this point the workload hitting in the LocalAI server should be distributed across the nodes!

🤖 Peer2Peer llama.cpp

LocalAI is the first AI Free, Open source project offering complete, decentralized, peer2peer while private, LLM inferencing on top of the libp2p protocol. There is no "public swarm" to offload the computation, but rather empowers you to build your own cluster of local and remote machines to distribute LLM computation.

This feature leverages the ability of llama.cpp to distribute the workload explained just above and features from one of my other projects, https://github.com/mudler/edgevpn.

LocalAI builds on top of the twos, and allows to create a private peer2peer network between nodes, without the need of centralizing connections or manually configuring IP addresses: it unlocks totally decentralized, private, peer-to-peer inferencing capabilities. Works also behind different NAT-ted networks (uses DHT and mDNS as discovery mechanism).

How it works: A pre-shared token can be generated and shared between workers and the server to form a private, decentralized, p2p network.

You can see the feature in action here:

output

How to use it

Start the server with --p2p:

./local-ai run --p2p
# 1:02AM INF loading environment variables from file envFile=.env
# 1:02AM INF Setting logging to info
# 1:02AM INF P2P mode enabled
# 1:02AM INF No token provided, generating one
# 1:02AM INF Generated Token:
# XXXXXXXXXXX
# 1:02AM INF Press a button to proceed

A token is displayed, copy it and press enter.

You can re-use the same token later restarting the server with --p2ptoken (or P2P_TOKEN).

Start the workers. Now you can copy the local-ai binary in other hosts, and run as many workers with that token:

TOKEN=XXX ./local-ai  p2p-llama-cpp-rpc
# 1:06AM INF loading environment variables from file envFile=.env
# 1:06AM INF Setting logging to info
# {"level":"INFO","time":"2024-05-19T01:06:01.794+0200","caller":"config/config.go:288","message":"connmanager disabled\n"}
# {"level":"INFO","time":"2024-05-19T01:06:01.794+0200","caller":"config/config.go:295","message":" go-libp2p resource manager protection enabled"}
# {"level":"INFO","time":"2024-05-19T01:06:01.794+0200","caller":"config/config.go:409","message":"max connections: 100\n"}
# 1:06AM INF Starting llama-cpp-rpc-server on '127.0.0.1:34371'
# {"level":"INFO","time":"2024-05-19T01:06:01.794+0200","caller":"node/node.go:118","message":" Starting EdgeVPN network"}
# create_backend: using CPU backend
# Starting RPC server on 127.0.0.1:34371, backend memory: 31913 MB
# 2024/05/19 01:06:01 failed to sufficiently increase receive buffer size (was: 208 kiB, wanted: 2048 kiB, got: 416 kiB). # See https://github.com/quic-go/quic-go/wiki/UDP-Buffer-Sizes for details.
# {"level":"INFO","time":"2024-05-19T01:06:01.805+0200","caller":"node/node.go:172","message":" Node ID: 12D3KooWJ7WQAbCWKfJgjw2oMMGGss9diw3Sov5hVWi8t4DMgx92"}
# {"level":"INFO","time":"2024-05-19T01:06:01.806+0200","caller":"node/node.go:173","message":" Node Addresses: [/ip4/127.0.0.1/tcp/44931 /ip4/127.0.0.1/udp/33251/quic-v1/webtransport/certhash/uEiAWAhZ-W9yx2ZHnKQm3BE_ft5jjoc468z5-Rgr9XdfjeQ/certhash/uEiB8Uwn0M2TQBELaV2m4lqypIAY2S-2ZMf7lt_N5LS6ojw /ip4/127.0.0.1/udp/35660/quic-v1 /ip4/192.168.68.110/tcp/44931 /ip4/192.168.68.110/udp/33251/quic-v1/webtransport/certhash/uEiAWAhZ-W9yx2ZHnKQm3BE_ft5jjoc468z5-Rgr9XdfjeQ/certhash/uEiB8Uwn0M2TQBELaV2m4lqypIAY2S-2ZMf7lt_N5LS6ojw /ip4/192.168.68.110/udp/35660/quic-v1 /ip6/::1/tcp/41289 /ip6/::1/udp/33160/quic-v1/webtransport/certhash/uEiAWAhZ-W9yx2ZHnKQm3BE_ft5jjoc468z5-Rgr9XdfjeQ/certhash/uEiB8Uwn0M2TQBELaV2m4lqypIAY2S-2ZMf7lt_N5LS6ojw /ip6/::1/udp/35701/quic-v1]"}
# {"level":"INFO","time":"2024-05-19T01:06:01.806+0200","caller":"discovery/dht.go:104","message":" Bootstrapping DHT"}

(Note you can also supply the token via args)

At this point, you should see in the server logs messages stating that new workers are found

Now you can start doing inference as usual on the server (the node used on step 1)

Interested in to try it out? As we are still updating the documentation, you can read the full instructions here https://github.com/mudler/LocalAI/pull/2343

📜 Advanced Function calling support with Mixed JSON Grammars

LocalAI gets better at function calling with mixed grammars!

With this release, LocalAI introduces a transformative capability: support for mixed JSON BNF grammars. It allows to specify a grammar for the LLM that allows to output structured JSON and free text.

How to use it:

To enable mixed grammars, you can set in the YAML configuration file function.mixed_mode = true, for example:

  function:
    # disable injecting the "answer" tool
    disable_no_action: true

    grammar:
      # This allows the grammar to also return messages
      mixed_mode: true

This feature significantly enhances LocalAI's ability to interpret and manipulate JSON data coming from the LLM through a more flexible and powerful grammar system. Users can now combine multiple grammar types within a single JSON structure, allowing for dynamic parsing and validation scenarios.

Grammars can also turned off entirely and leave the user to determine how the data is parsed from the LLM to be correctly interpretated by LocalAI to be still compliant to the OpenAI REST spec.

For example, to interpret Hermes results, one can just annotate regexes in function.json_regex_match to extract the LLM response:

  function:
    grammar:
      disable: true
    # disable injecting the "answer" tool
    disable_no_action: true
    return_name_in_function_response: true

    json_regex_match:
    - "(?s)<tool_call>(.*?)</tool_call>"
    - "(?s)<tool_call>(.*?)"
  
    replace_llm_results:
    # Drop the scratchpad content from responses
    - key: "(?s)<scratchpad>.*</scratchpad>"
      value: ""
    replace_function_results:
    # Replace everything that is not JSON array or object, just in case.
    - key: '(?s)^[^{\[]*'
      value: ""
    - key: '(?s)[^}\]]*$'
      value: ""
    # Drop the scratchpad content from responses
    - key: "(?s)<scratchpad>.*</scratchpad>"
      value: ""

Note that regex can still be used when enabling mixed grammars is enabled.

This is especially important for models which does not support grammars - such as transformers or OpenVINO models, that now can support as well function calling. As we update the docs, further documentation can be found in the PRs that you can find in the changelog below.

🚀 New Model Additions and Updates

local-ai-yi-updates

Our model gallery continues to grow with exciting new additions like Aya-35b, Mistral-0.3, Hermes-Theta and updates to existing models ensuring they remain at the cutting edge.

This release is having major enhancements on tool calling support. Besides working on making our default models in AIO images more performant - now you can try an enhanced out-of-the-box experience with function calling in the Hermes model family ( Hermes-2-Pro-Mistral and Hermes-2-Theta-Llama-3)

Our LocalAI function model!

local-ai-functioncall-model

I have fine-tuned a function call model specific to leverage entirely the grammar support of LocalAI, you can find it in the model gallery already and on huggingface

🔄 Single Binary Release: Simplified Deployment and Management

In our continuous effort to streamline the user experience and deployment process, LocalAI v2.16.0 proudly introduces a single binary release. This enhancement, thanks to @sozercan's contributions, consolidates all variants (CUDA and non-cuda releases) and dependencies into one compact executable file.

This change simplifies the installation and update processes, reduces compatibility issues, and speeds up the setup for new users and existing deployments as now binary releases are even more portable than ever!

🔧 Bug Fixes and Improvements

A host of bug fixes have been implemented to ensure smoother operation and integration. Key fixes include enhancements to the Intel build process, stability adjustments for setuptools in Python backends, and critical updates ensuring the successful build of p2p configurations.

Migrating Python Backends: From Conda to UV

LocalAI has migrated its Python backends from Conda to UV. This transition, thanks to @cryptk contributions, enhances the efficiency and scalability of our backend operations. Users will experience faster setup times and reduced complexity, streamlining the development process and making it easier to manage dependencies across different environments.

📣 Let's Make Some Noise!

Also, if you haven't yet joined our Discord, come on over! Here's the link: https://discord.gg/uJAeKSAGDy

Thanks a ton, and.. enjoy this release!

What's Changed

Bug fixes 🐛

build: do not specify a BUILD_ID by default by @mudler in https://github.com/mudler/LocalAI/pull/2284
fix: add missing openvino/optimum/etc libraries for Intel, fixes #2289 by @cryptk in https://github.com/mudler/LocalAI/pull/2292
add setuptools for openvino by @fakezeta in https://github.com/mudler/LocalAI/pull/2301
fix: add setuptools to all requirements-intel.txt files for python backends by @cryptk in https://github.com/mudler/LocalAI/pull/2333
ci: correctly build p2p in GO_TAGS by @mudler in https://github.com/mudler/LocalAI/pull/2369
ci: generate specific image for intel builds by @mudler in https://github.com/mudler/LocalAI/pull/2374
fix: stablediffusion binary by @sozercan in https://github.com/mudler/LocalAI/pull/2385

Exciting New Features 🎉

feat: migrate python backends from conda to uv by @cryptk in https://github.com/mudler/LocalAI/pull/2215
feat: create bash library to handle install/run/test of python backends by @cryptk in https://github.com/mudler/LocalAI/pull/2286
feat(grammar): support models with specific construct by @mudler in https://github.com/mudler/LocalAI/pull/2291
feat(ui): display number of available models for installation by @mudler in https://github.com/mudler/LocalAI/pull/2298
feat: auto select llama-cpp cpu variant by @sozercan in https://github.com/mudler/LocalAI/pull/2305
feat(llama.cpp): add flash_attention and no_kv_offloading by @mudler in https://github.com/mudler/LocalAI/pull/2310
feat(functions): support models with no grammar and no regex by @mudler in https://github.com/mudler/LocalAI/pull/2315
feat(functions): allow to set JSON matcher by @mudler in https://github.com/mudler/LocalAI/pull/2319
feat: auto select llama-cpp cuda runtime by @sozercan in https://github.com/mudler/LocalAI/pull/2306
feat(llama.cpp): add distributed llama.cpp inferencing by @mudler in https://github.com/mudler/LocalAI/pull/2324
feat(functions): mixed JSON BNF grammars by @mudler in https://github.com/mudler/LocalAI/pull/2328
feat(functions): simplify parsing, read functions as list by @mudler in https://github.com/mudler/LocalAI/pull/2340
feat(functions): Enable true regex replacement for the regexReplacement option by @lenaxia in https://github.com/mudler/LocalAI/pull/2341
feat(backends): add openvoice backend by @mudler in https://github.com/mudler/LocalAI/pull/2334
feat(webui): statically embed js/css assets by @mudler in https://github.com/mudler/LocalAI/pull/2348
feat(functions): allow to use JSONRegexMatch unconditionally by @mudler in https://github.com/mudler/LocalAI/pull/2349
feat(functions): don't use yaml.MapSlice by @mudler in https://github.com/mudler/LocalAI/pull/2354
build: add sha by @mudler in https://github.com/mudler/LocalAI/pull/2356
feat(llama.cpp): Totally decentralized, private, distributed, p2p inference by @mudler in https://github.com/mudler/LocalAI/pull/2343
feat(functions): relax mixedgrammars by @mudler in https://github.com/mudler/LocalAI/pull/2365
models(gallery): add mistral-0.3 and command-r, update functions by @mudler in https://github.com/mudler/LocalAI/pull/2388

🧠 Models

models(gallery): add aloe by @mudler in https://github.com/mudler/LocalAI/pull/2283
models(gallery): add Llama-3-8B-Instruct-abliterated by @mudler in https://github.com/mudler/LocalAI/pull/2288
models(gallery): add l3-chaoticsoliloquy-v1.5-4x8b by @mudler in https://github.com/mudler/LocalAI/pull/2295
models(gallery): add jsl-medllama-3-8b-v2.0 by @mudler in https://github.com/mudler/LocalAI/pull/2296
models(gallery): add llama-3-refueled by @mudler in https://github.com/mudler/LocalAI/pull/2297
models(gallery): add aura-llama-Abliterated by @mudler in https://github.com/mudler/LocalAI/pull/2309
models(gallery): add Bunny-llama by @mudler in https://github.com/mudler/LocalAI/pull/2311
models(gallery): add lumimaidv2 by @mudler in https://github.com/mudler/LocalAI/pull/2312
models(gallery): add orthocopter by @mudler in https://github.com/mudler/LocalAI/pull/2313
fix(gallery) Correct llama3-8b-instruct model file by @tannisroot in https://github.com/mudler/LocalAI/pull/2330
models(gallery): add hermes-2-theta-llama-3-8b by @mudler in https://github.com/mudler/LocalAI/pull/2331
models(gallery): add yi 6/9b, sqlcoder, sfr-iterative-dpo by @mudler in https://github.com/mudler/LocalAI/pull/2335
models(gallery): add anita by @mudler in https://github.com/mudler/LocalAI/pull/2344
models(gallery): add master-yi by @mudler in https://github.com/mudler/LocalAI/pull/2345
models(gallery): update poppy porpoise mmproj by @mudler in https://github.com/mudler/LocalAI/pull/2346
models(gallery): add LocalAI-Llama3-8b-Function-Call-v0.2-GGUF by @mudler in https://github.com/mudler/LocalAI/pull/2355
models(gallery): add stheno by @mudler in https://github.com/mudler/LocalAI/pull/2358
fix(gallery): checksum Meta-Llama-3-70B-Instruct.Q4_K_M.gguf - #2364 by @Nold360 in https://github.com/mudler/LocalAI/pull/2366
models(gallery): add phi-3-medium-4k-instruct by @mudler in https://github.com/mudler/LocalAI/pull/2367
models(gallery): add hercules and helpingAI by @mudler in https://github.com/mudler/LocalAI/pull/2376
ci(checksum_checker): do get sha from hf API when available by @mudler in https://github.com/mudler/LocalAI/pull/2380
models(gallery): ⬆️ update checksum by @localai-bot in https://github.com/mudler/LocalAI/pull/2383
models(gallery): ⬆️ update checksum by @localai-bot in https://github.com/mudler/LocalAI/pull/2386
models(gallery): add aya-35b by @mudler in https://github.com/mudler/LocalAI/pull/2391

📖 Documentation and examples

docs: Update semantic-todo/README.md by @eltociear in https://github.com/mudler/LocalAI/pull/2294
Add Home Assistant Integration by @valentinfrlch in https://github.com/mudler/LocalAI/pull/2387
Add warning for running the binary on MacOS by @mauromorales in https://github.com/mudler/LocalAI/pull/2389

👒 Dependencies

⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2281
⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/2280
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2285
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2290
feat(swagger): update swagger by @localai-bot in https://github.com/mudler/LocalAI/pull/2302
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2303
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2317
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2326
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2316
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2329
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2337
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2339
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2342
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2351
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2352
dependencies(grpcio): bump to fix CI issues by @mudler in https://github.com/mudler/LocalAI/pull/2362
deps(llama.cpp): update and adapt API changes by @mudler in https://github.com/mudler/LocalAI/pull/2381
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2361
⬆️ Update go-skynet/go-bert.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1225
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2360

Other Changes

refactor: Minor improvements to BackendConfigLoader by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2353

New Contributors

@tannisroot made their first contribution in https://github.com/mudler/LocalAI/pull/2330
@lenaxia made their first contribution in https://github.com/mudler/LocalAI/pull/2341
@valentinfrlch made their first contribution in https://github.com/mudler/LocalAI/pull/2387

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.15.0...v2.16.0

LocalAI - v2.15.0

Published by mudler 5 months ago

local-ai-release

🎉 LocalAI v2.15.0! 🚀

Hey awesome people! I'm happy to announce the release of LocalAI version 2.15.0! This update introduces several significant improvements and features, enhancing usability, functionality, and user experience across the board. Dive into the key highlights below, and don't forget to check out the full changelog for more detailed updates.

🌍 WebUI Upgrades: Turbocharged!

🚀 Vision API Integration

The Chat WebUI now seamlessly integrates with the Vision API, making it easier for users to test image processing models directly through the browser interface - this is a very simple and hackable interface in less then 400L of code with Alpine.JS and HTMX!

output

💬 System Prompts in Chat

System prompts can be set in the WebUI chat, which guide the user through interactions more intuitively, making our chat interface smarter and more responsive.

output

🌟 Revamped Welcome Page

New to LocalAI or haven't installed any models yet? No worries! The updated welcome page now guides users through the model installation process, ensuring you're set up and ready to go without any hassle. This is a great first step for newcomers - thanks for your precious feedback!

output

🔄 Background Operations Indicator

Don't get lost with our new background operations indicator on the WebUI, which shows when tasks are running in the background.

output

🔍 Filter Models by Tag and Category

As our model gallery balloons, you can now effortlessly sift through models by tag and category, making finding what you need a breeze.

output

🔧 Single Binary Release

LocalAI is expanding into offering single binary releases, simplifying the deployment process and making it easier to get LocalAI up and running on any system.

For the moment we have condensed the builds which disables AVX and SSE instructions set. We are also planning to include cuda builds as well.

🧠 Expanded Model Gallery

This release introduces several exciting new models to our gallery, such as 'Soliloquy', 'tess', 'moondream2', 'llama3-instruct-coder' and 'aurora', enhancing the diversity and capability of our AI offerings. Our selection of one-click-install models is growing! We pick carefully model from the most trending ones on huggingface, feel free to submit your requests in a github issue, hop to our Discord or contribute by hosting your gallery, or.. even by adding models directly to LocalAI!

local-ai-gallery
local-ai-gallery-new

Want to share your model configurations and customizations? See the docs: https://localai.io/docs/getting-started/customize-model/

📣 Let's Make Some Noise!

Also, if you haven't yet joined our Discord, come on over! Here's the link: https://discord.gg/uJAeKSAGDy

Thanks a ton, and.. enjoy this release!

What's Changed

Bug fixes 🐛

fix(webui): correct documentation URL for text2img by @mudler in https://github.com/mudler/LocalAI/pull/2233
fix(ux): fix small glitches by @mudler in https://github.com/mudler/LocalAI/pull/2265

Exciting New Features 🎉

feat: update ROCM and use smaller image by @cryptk in https://github.com/mudler/LocalAI/pull/2196
feat(llama.cpp): do not specify backends to autoload and add llama.cpp variants by @mudler in https://github.com/mudler/LocalAI/pull/2232
fix(webui): display small navbar with smaller screens by @mudler in https://github.com/mudler/LocalAI/pull/2240
feat(startup): show CPU/GPU information with --debug by @mudler in https://github.com/mudler/LocalAI/pull/2241
feat(single-build): generate single binaries for releases by @mudler in https://github.com/mudler/LocalAI/pull/2246
feat(webui): ux improvements by @mudler in https://github.com/mudler/LocalAI/pull/2247
fix: OpenVINO winograd always disabled by @fakezeta in https://github.com/mudler/LocalAI/pull/2252
UI: flag trust_remote_code to users // favicon support by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2253
feat(ui): prompt for chat, support vision, enhancements by @mudler in https://github.com/mudler/LocalAI/pull/2259

🧠 Models

fix(gallery): hermes-2-pro-llama3 models checksum changed by @Nold360 in https://github.com/mudler/LocalAI/pull/2236
models(gallery): add moondream2 by @mudler in https://github.com/mudler/LocalAI/pull/2237
models(gallery): add llama3-llava by @mudler in https://github.com/mudler/LocalAI/pull/2238
models(gallery): add llama3-instruct-coder by @mudler in https://github.com/mudler/LocalAI/pull/2242
models(gallery): update poppy porpoise by @mudler in https://github.com/mudler/LocalAI/pull/2243
models(gallery): add lumimaid by @mudler in https://github.com/mudler/LocalAI/pull/2244
models(gallery): add openbiollm by @mudler in https://github.com/mudler/LocalAI/pull/2245
gallery: Added some OpenVINO models by @fakezeta in https://github.com/mudler/LocalAI/pull/2249
models(gallery): Add Soliloquy by @mudler in https://github.com/mudler/LocalAI/pull/2260
models(gallery): add tess by @mudler in https://github.com/mudler/LocalAI/pull/2266
models(gallery): add lumimaid variant by @mudler in https://github.com/mudler/LocalAI/pull/2267
models(gallery): add kunocchini by @mudler in https://github.com/mudler/LocalAI/pull/2268
models(gallery): add aurora by @mudler in https://github.com/mudler/LocalAI/pull/2270
models(gallery): add tiamat by @mudler in https://github.com/mudler/LocalAI/pull/2269

📖 Documentation and examples

docs: updated Transformer parameters description by @fakezeta in https://github.com/mudler/LocalAI/pull/2234
Update readme: add ShellOracle to community integrations by @djcopley in https://github.com/mudler/LocalAI/pull/2254
Add missing Homebrew dependencies by @michaelmior in https://github.com/mudler/LocalAI/pull/2256

👒 Dependencies

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/2228
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2229
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2230
build(deps): bump tqdm from 4.65.0 to 4.66.3 in /examples/langchain/langchainpy-localai-example in the pip group across 1 directory by @dependabot in https://github.com/mudler/LocalAI/pull/2231
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2239
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2251
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2255
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2263

Other Changes

test: check the response URL during image gen in app_test.go by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2248

New Contributors

@Nold360 made their first contribution in https://github.com/mudler/LocalAI/pull/2236
@djcopley made their first contribution in https://github.com/mudler/LocalAI/pull/2254
@michaelmior made their first contribution in https://github.com/mudler/LocalAI/pull/2256

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.14.0...v2.15.0

LocalAI - v2.14.0

Published by mudler 6 months ago

🚀 AIO Image Update: llama3 has landed!

We're excited to announce that our AIO image has been upgraded with the latest LLM model, llama3, enhancing our capabilities with more accurate and dynamic responses. Behind the scenes uses https://huggingface.co/NousResearch/Hermes-2-Pro-Llama-3-8B-GGUF which is ready for function call, yay!

💬 WebUI enhancements: Updates in Chat, Image Generation, and TTS

Chat	TTS	Image gen

Our interfaces for Chat, Text-to-Speech (TTS), and Image Generation have finally landed. Enjoy streamlined and simple interactions thanks to the efforts of our team, led by @mudler, who have worked tirelessly to enhance your experience. The WebUI interface serves as a quick way to debug and assess models loaded in LocalAI - there is much to improve, but we have now a small, hackable interface!

🖼️ Many new models in the model gallery!

The model gallery has received a substantial upgrade with numerous new models, including Einstein v6.1, SOVL, and several specialized Llama3 iterations. These additions are designed to cater to a broader range of tasks , making LocalAI more versatile than ever. Kudos to @mudler for spearheading these exciting updates - now you can select with a couple of click the model you like!

🛠️ Robust Fixes and Optimizations

This update brings a series of crucial bug fixes and security enhancements to ensure our platform remains secure and efficient. Special thanks to @dave-gray101, @cryptk, and @fakezeta for their diligent work in rooting out and resolving these issues 🤗

✨ OpenVINO and more

We're introducing OpenVINO acceleration, and many OpenVINO models in the gallery. You can now enjoy fast-as-hell speed on Intel CPU and GPUs. Applause to @fakezeta for the contributions!

📚 Documentation and Dependency Upgrades

We've updated our documentation and dependencies to keep you equipped with the latest tools and knowledge. These updates ensure that LocalAI remains a robust and dependable platform.

👥 A Community Effort

A special shout-out to our new contributors, @QuinnPiers and @LeonSijiaLu, who have enriched our community with their first contributions. Welcome aboard, and thank you for your dedication and fresh insights!

Each update in this release not only enhances our platform's capabilities but also ensures a safer and more user-friendly experience. We are excited to see how our users leverage these new features in their projects, freel free to hit a line on Twitter or in any other social, we'd be happy to hear how you use LocalAI!

📣 Spread the word!

First off, a massive thank you (again!) to each and every one of you who've chipped in to squash bugs and suggest cool new features for LocalAI. Your help, kind words, and brilliant ideas are truly appreciated - more than words can say!

And to those of you who've been heros, giving up your own time to help out fellow users on Discord and in our repo, you're absolutely amazing. We couldn't have asked for a better community.

Just so you know, LocalAI doesn't have the luxury of big corporate sponsors behind it. It's all us, folks. So, if you've found value in what we're building together and want to keep the momentum going, consider showing your support. A little shoutout on your favorite social platforms using @LocalAI_OSS and @mudler_it or joining our sponsors can make a big difference.

Also, if you haven't yet joined our Discord, come on over! Here's the link: https://discord.gg/uJAeKSAGDy

Every bit of support, every mention, and every star adds up and helps us keep this ship sailing. Let's keep making LocalAI awesome together!

Thanks a ton, and.. exciting times ahead with LocalAI!

What's Changed

Bug fixes 🐛

fix: config_file_watcher.go - root all file reads for safety by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2144
fix: github bump_docs.sh regex to drop emoji and other text by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2180
fix: undefined symbol: iJIT_NotifyEvent in import torch ##2153 by @fakezeta in https://github.com/mudler/LocalAI/pull/2179
fix: security scanner warning noise: error handlers part 2 by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2145
fix: ensure GNUMake jobserver is passed through to whisper.cpp build by @cryptk in https://github.com/mudler/LocalAI/pull/2187
fix: bring everything onto the same GRPC version to fix tests by @cryptk in https://github.com/mudler/LocalAI/pull/2199

Exciting New Features 🎉

feat(gallery): display job status also during navigation by @mudler in https://github.com/mudler/LocalAI/pull/2151
feat: cleanup Dockerfile and make final image a little smaller by @cryptk in https://github.com/mudler/LocalAI/pull/2146
fix: swap to WHISPER_CUDA per deprecation message from whisper.cpp by @cryptk in https://github.com/mudler/LocalAI/pull/2170
feat: only keep the build artifacts from the grpc build by @cryptk in https://github.com/mudler/LocalAI/pull/2172
feat(gallery): support model deletion by @mudler in https://github.com/mudler/LocalAI/pull/2173
refactor(application): introduce application global state by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2072
feat: organize Dockerfile into distinct sections by @cryptk in https://github.com/mudler/LocalAI/pull/2181
feat: OpenVINO acceleration for embeddings in transformer backend by @fakezeta in https://github.com/mudler/LocalAI/pull/2190
chore: update go-stablediffusion to latest commit with Make jobserver fix by @cryptk in https://github.com/mudler/LocalAI/pull/2197
feat: user defined inference device for CUDA and OpenVINO by @fakezeta in https://github.com/mudler/LocalAI/pull/2212
feat(ux): Add chat, tts, and image-gen pages to the WebUI by @mudler in https://github.com/mudler/LocalAI/pull/2222
feat(aio): switch to llama3-based for LLM by @mudler in https://github.com/mudler/LocalAI/pull/2225
feat(ui): support multilineand style ul by @mudler in https://github.com/mudler/LocalAI/pull/2226

🧠 Models

models(gallery): add Einstein v6.1 by @mudler in https://github.com/mudler/LocalAI/pull/2152
models(gallery): add SOVL by @mudler in https://github.com/mudler/LocalAI/pull/2154
models(gallery): add average_normie by @mudler in https://github.com/mudler/LocalAI/pull/2155
models(gallery): add solana by @mudler in https://github.com/mudler/LocalAI/pull/2157
models(gallery): add poppy porpoise by @mudler in https://github.com/mudler/LocalAI/pull/2158
models(gallery): add Undi95/Llama-3-LewdPlay-8B-evo-GGUF by @mudler in https://github.com/mudler/LocalAI/pull/2160
models(gallery): add biomistral-7b by @mudler in https://github.com/mudler/LocalAI/pull/2161
models(gallery): add llama3-32k by @mudler in https://github.com/mudler/LocalAI/pull/2183
models(gallery): add openvino models by @mudler in https://github.com/mudler/LocalAI/pull/2184
models(gallery): add lexifun by @mudler in https://github.com/mudler/LocalAI/pull/2193
models(gallery): add suzume-llama-3-8B-multilingual-gguf by @mudler in https://github.com/mudler/LocalAI/pull/2194
models(gallery): add guillaumetell by @mudler in https://github.com/mudler/LocalAI/pull/2195
models(gallery): add wizardlm2 by @mudler in https://github.com/mudler/LocalAI/pull/2209
models(gallery): Add Hermes-2-Pro-Llama-3-8B-GGUF by @mudler in https://github.com/mudler/LocalAI/pull/2218

📖 Documentation and examples

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/2149
draft:Update model-gallery.md with correct gallery file by @QuinnPiers in https://github.com/mudler/LocalAI/pull/2163
docs: update gallery, add rerankers by @mudler in https://github.com/mudler/LocalAI/pull/2166
docs: enhance and condense few sections by @mudler in https://github.com/mudler/LocalAI/pull/2178
[Documentations] Removed invalid numberings from troubleshooting mac by @LeonSijiaLu in https://github.com/mudler/LocalAI/pull/2174

👒 Dependencies

⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2150
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2159
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2176
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2177
update go-tinydream to latest commit by @cryptk in https://github.com/mudler/LocalAI/pull/2182
build(deps): bump dependabot/fetch-metadata from 2.0.0 to 2.1.0 by @dependabot in https://github.com/mudler/LocalAI/pull/2186
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2189
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2188
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2203
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2213

Other Changes

Revert "⬆️ Update docs version mudler/LocalAI" by @mudler in https://github.com/mudler/LocalAI/pull/2165
Issue-1720: Updated Build on mac documentations by @LeonSijiaLu in https://github.com/mudler/LocalAI/pull/2171
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2224

New Contributors

@QuinnPiers made their first contribution in https://github.com/mudler/LocalAI/pull/2163
@LeonSijiaLu made their first contribution in https://github.com/mudler/LocalAI/pull/2171

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.13.0...v2.14.0

LocalAI - 🖼️ v2.13.0 - Model gallery edition

Published by mudler 6 months ago

Hello folks, Ettore here - I'm happy to announce the v2.13.0 LocalAI release is out, with many features!

Below there is a small breakdown of the hottest features introduced in this release - however - there are many other improvements (especially from the community) as well, so don't miss out the changelog!

Check out the full changelog below for having an overview of all the changes that went in this release (this one is quite packed up).

🖼️ Model gallery

This is the first release with model gallery in the webUI, you can see now a "Model" button in the WebUI which lands now in a selection of models:

output

You can choose now models between stablediffusion, llama3, tts, embeddings and more! The gallery is growing steadly and being kept up-to-date.

The models are simple YAML files which are hosted in this repository: https://github.com/mudler/LocalAI/tree/master/gallery - you can host your own repository with your model index, or if you want you can contribute to LocalAI.

If you want to contribute adding models, you can by opening up a PR in the gallery directory: https://github.com/mudler/LocalAI/tree/master/gallery.

Rerankers

I'm excited to introduce a new backend for rerankers. LocalAI now implements the Jina API (https://jina.ai/reranker/#apiform) as a compatibility layer, and you can use existing Jina clients and point to those to the LocalAI address. Behind the hoods, uses https://github.com/AnswerDotAI/rerankers.

output

You can test this by using container images with python (this does NOT work with core images) and a model config file like this, or by installing cross-encoder from the gallery in the UI:

name: jina-reranker-v1-base-en
backend: rerankers
parameters:
  model: cross-encoder

and test it with:


    curl http://localhost:8080/v1/rerank \
      -H "Content-Type: application/json" \
      -d '{
      "model": "jina-reranker-v1-base-en",
      "query": "Organic skincare products for sensitive skin",
      "documents": [
        "Eco-friendly kitchenware for modern homes",
        "Biodegradable cleaning supplies for eco-conscious consumers",
        "Organic cotton baby clothes for sensitive skin",
        "Natural organic skincare range for sensitive skin",
        "Tech gadgets for smart homes: 2024 edition",
        "Sustainable gardening tools and compost solutions",
        "Sensitive skin-friendly facial cleansers and toners",
        "Organic food wraps and storage solutions",
        "All-natural pet food for dogs with allergies",
        "Yoga mats made from recycled materials"
      ],
      "top_n": 3
    }'

Parler-tts

There is a new backend available for tts now, parler-tts. It is possible to install and configure the model directly from the gallery. https://github.com/huggingface/parler-tts

🎈 Lot of small improvements behind the scenes!

Thanks to our outstanding community, we have enhanced the performance and stability of LocalAI across various modules. From backend optimizations to front-end adjustments, every tweak helps make LocalAI smoother and more robust.

📣 Spread the word!

And to those of you who've been heros, giving up your own time to help out fellow users on Discord and in our repo, you're absolutely amazing. We couldn't have asked for a better community.

Also, if you haven't yet joined our Discord, come on over! Here's the link: https://discord.gg/uJAeKSAGDy

Every bit of support, every mention, and every star adds up and helps us keep this ship sailing. Let's keep making LocalAI awesome together!

Thanks a ton, and here's to more exciting times ahead with LocalAI!

What's Changed

Bug fixes 🐛

fix(autogptq): do not use_triton with qwen-vl by @thiner in https://github.com/mudler/LocalAI/pull/1985
fix: respect concurrency from parent build parameters when building GRPC by @cryptk in https://github.com/mudler/LocalAI/pull/2023
ci: fix release pipeline missing dependencies by @mudler in https://github.com/mudler/LocalAI/pull/2025
fix: remove build path from help text documentation by @cryptk in https://github.com/mudler/LocalAI/pull/2037
fix: previous CLI rework broke debug logging by @cryptk in https://github.com/mudler/LocalAI/pull/2036
fix(fncall): fix regression introduced in #1963 by @mudler in https://github.com/mudler/LocalAI/pull/2048
fix: adjust some sources names to match the naming of their repositories by @cryptk in https://github.com/mudler/LocalAI/pull/2061
fix: move the GRPC cache generation workflow into it's own concurrency group by @cryptk in https://github.com/mudler/LocalAI/pull/2071
fix(llama.cpp): set -1 as default for max tokens by @mudler in https://github.com/mudler/LocalAI/pull/2087
fix(llama.cpp-ggml): fixup max_tokens for old backend by @mudler in https://github.com/mudler/LocalAI/pull/2094
fix missing TrustRemoteCode in OpenVINO model load by @fakezeta in https://github.com/mudler/LocalAI/pull/2114
Incl ocv pkg for diffsusers utils by @jtwolfe in https://github.com/mudler/LocalAI/pull/2115

Exciting New Features 🎉

feat: kong cli refactor fixes #1955 by @cryptk in https://github.com/mudler/LocalAI/pull/1974
feat: add flash-attn in nvidia and rocm envs by @golgeek in https://github.com/mudler/LocalAI/pull/1995
feat: use tokenizer.apply_chat_template() in vLLM by @golgeek in https://github.com/mudler/LocalAI/pull/1990
feat(gallery): support ConfigURLs by @mudler in https://github.com/mudler/LocalAI/pull/2012
fix: dont commit generated files to git by @cryptk in https://github.com/mudler/LocalAI/pull/1993
feat(parler-tts): Add new backend by @mudler in https://github.com/mudler/LocalAI/pull/2027
feat(grpc): return consumed token count and update response accordingly by @mudler in https://github.com/mudler/LocalAI/pull/2035
feat(store): add Golang client by @mudler in https://github.com/mudler/LocalAI/pull/1977
feat(functions): support models with no grammar, add tests by @mudler in https://github.com/mudler/LocalAI/pull/2068
refactor(template): isolate and add tests by @mudler in https://github.com/mudler/LocalAI/pull/2069
feat: fiber logs with zerlog and add trace level by @cryptk in https://github.com/mudler/LocalAI/pull/2082
models(gallery): add gallery by @mudler in https://github.com/mudler/LocalAI/pull/2078
Add tensor_parallel_size setting to vllm setting items by @Taikono-Himazin in https://github.com/mudler/LocalAI/pull/2085
Transformer Backend: Implementing use_tokenizer_template and stop_prompts options by @fakezeta in https://github.com/mudler/LocalAI/pull/2090
feat: Galleries UI by @mudler in https://github.com/mudler/LocalAI/pull/2104
Transformers Backend: max_tokens adherence to OpenAI API by @fakezeta in https://github.com/mudler/LocalAI/pull/2108
Fix cleanup sonarqube findings by @cryptk in https://github.com/mudler/LocalAI/pull/2106
feat(models-ui): minor visual enhancements by @mudler in https://github.com/mudler/LocalAI/pull/2109
fix(gallery): show a fake image if no there is no icon by @mudler in https://github.com/mudler/LocalAI/pull/2111
feat(rerankers): Add new backend, support jina rerankers API by @mudler in https://github.com/mudler/LocalAI/pull/2121

🧠 Models

models(llama3): add llama3 to embedded models by @mudler in https://github.com/mudler/LocalAI/pull/2074
feat(gallery): add llama3, hermes, phi-3, and others by @mudler in https://github.com/mudler/LocalAI/pull/2110
models(gallery): add new models to the gallery by @mudler in https://github.com/mudler/LocalAI/pull/2124
models(gallery): add more models by @mudler in https://github.com/mudler/LocalAI/pull/2129

📖 Documentation and examples

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1988
docs: fix stores link by @adrienbrault in https://github.com/mudler/LocalAI/pull/2044
AMD/ROCm Documentation update + formatting fix by @jtwolfe in https://github.com/mudler/LocalAI/pull/2100

👒 Dependencies

deps: Update version of vLLM to add support of Cohere Command_R model in vLLM inference by @holyCowMp3 in https://github.com/mudler/LocalAI/pull/1975
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1991
build(deps): bump google.golang.org/protobuf from 1.31.0 to 1.33.0 by @dependabot in https://github.com/mudler/LocalAI/pull/1998
build(deps): bump github.com/docker/docker from 20.10.7+incompatible to 24.0.9+incompatible by @dependabot in https://github.com/mudler/LocalAI/pull/1999
build(deps): bump github.com/gofiber/fiber/v2 from 2.52.0 to 2.52.1 by @dependabot in https://github.com/mudler/LocalAI/pull/2001
build(deps): bump actions/checkout from 3 to 4 by @dependabot in https://github.com/mudler/LocalAI/pull/2002
build(deps): bump actions/setup-go from 4 to 5 by @dependabot in https://github.com/mudler/LocalAI/pull/2003
build(deps): bump peter-evans/create-pull-request from 5 to 6 by @dependabot in https://github.com/mudler/LocalAI/pull/2005
build(deps): bump actions/cache from 3 to 4 by @dependabot in https://github.com/mudler/LocalAI/pull/2006
build(deps): bump actions/upload-artifact from 3 to 4 by @dependabot in https://github.com/mudler/LocalAI/pull/2007
build(deps): bump github.com/charmbracelet/glamour from 0.6.0 to 0.7.0 by @dependabot in https://github.com/mudler/LocalAI/pull/2004
build(deps): bump github.com/gofiber/fiber/v2 from 2.52.0 to 2.52.4 by @dependabot in https://github.com/mudler/LocalAI/pull/2008
build(deps): bump github.com/opencontainers/runc from 1.1.5 to 1.1.12 by @dependabot in https://github.com/mudler/LocalAI/pull/2000
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2014
build(deps): bump the pip group across 4 directories with 8 updates by @dependabot in https://github.com/mudler/LocalAI/pull/2017
build(deps): bump follow-redirects from 1.15.2 to 1.15.6 in /examples/langchain/langchainjs-localai-example by @dependabot in https://github.com/mudler/LocalAI/pull/2020
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2024
build(deps): bump softprops/action-gh-release from 1 to 2 by @dependabot in https://github.com/mudler/LocalAI/pull/2039
build(deps): bump dependabot/fetch-metadata from 1.3.4 to 2.0.0 by @dependabot in https://github.com/mudler/LocalAI/pull/2040
build(deps): bump github/codeql-action from 2 to 3 by @dependabot in https://github.com/mudler/LocalAI/pull/2041
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2043
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2042
build(deps): bump the pip group across 4 directories with 8 updates by @dependabot in https://github.com/mudler/LocalAI/pull/2049
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2050
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2060
build(deps): bump aiohttp from 3.9.2 to 3.9.4 in /examples/langchain/langchainpy-localai-example in the pip group across 1 directory by @dependabot in https://github.com/mudler/LocalAI/pull/2067
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2089
deps(llama.cpp): update, use better model for function call tests by @mudler in https://github.com/mudler/LocalAI/pull/2119
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2122
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2123
build(deps): bump pydantic from 1.10.7 to 1.10.13 in /examples/langchain/langchainpy-localai-example in the pip group across 1 directory by @dependabot in https://github.com/mudler/LocalAI/pull/2125
feat(swagger): update swagger by @localai-bot in https://github.com/mudler/LocalAI/pull/2128

Other Changes

ci: try to build on macos14 by @mudler in https://github.com/mudler/LocalAI/pull/2011
⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/2013
refactor: backend/service split, channel-based llm flow by @dave-gray101 in https://github.com/mudler/LocalAI/pull/1963
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2028
fix - correct checkout versions by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2029
Revert "build(deps): bump the pip group across 4 directories with 8 updates" by @mudler in https://github.com/mudler/LocalAI/pull/2030
⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/2032
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2033
fix: action-tmate back to upstream, dead code removal by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2038
Revert #1963 by @mudler in https://github.com/mudler/LocalAI/pull/2056
feat: refactor the dynamic json configs for api_keys and external_backends by @cryptk in https://github.com/mudler/LocalAI/pull/2055
tests: add template tests by @mudler in https://github.com/mudler/LocalAI/pull/2063
feat: better control of GRPC docker cache by @cryptk in https://github.com/mudler/LocalAI/pull/2070
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2051
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/2080
feat: enable polling configs for systems with broken fsnotify (docker volumes on windows) by @cryptk in https://github.com/mudler/LocalAI/pull/2081
fix: action-tmate: use connect-timeout-sections and limit-access-to-actor by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2083
refactor(routes): split routes registration by @mudler in https://github.com/mudler/LocalAI/pull/2077
fix: action-tmate detached by @dave-gray101 in https://github.com/mudler/LocalAI/pull/2092
fix: rename fiber entrypoint from http/api to http/app by @mudler in https://github.com/mudler/LocalAI/pull/2096
fix: typo in models.go by @eltociear in https://github.com/mudler/LocalAI/pull/2099
Update text-generation.md by @Taikono-Himazin in https://github.com/mudler/LocalAI/pull/2095
⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/2105
⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/2113

New Contributors

@holyCowMp3 made their first contribution in https://github.com/mudler/LocalAI/pull/1975
@dependabot made their first contribution in https://github.com/mudler/LocalAI/pull/1998
@adrienbrault made their first contribution in https://github.com/mudler/LocalAI/pull/2044
@Taikono-Himazin made their first contribution in https://github.com/mudler/LocalAI/pull/2085
@eltociear made their first contribution in https://github.com/mudler/LocalAI/pull/2099
@jtwolfe made their first contribution in https://github.com/mudler/LocalAI/pull/2100

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.12.4...V2.13.0

LocalAI - v2.12.4

Published by mudler 6 months ago

Patch release to include https://github.com/mudler/LocalAI/pull/1985

LocalAI - v2.12.3

Published by mudler 6 months ago

I'm happy to announce the v2.12.3 LocalAI release is out!

🌠 Landing page and Swagger

Ever wondered what to do after LocalAI is up and running? Integration with a simple web interface has been started, and you can see now a landing page when hitting the LocalAI front page:

Screenshot from 2024-04-07 14-43-26

You can also now enjoy Swagger to try out the API calls directly:

swagger

🌈 AIO images changes

Now the default model for CPU images is https://huggingface.co/NousResearch/Hermes-2-Pro-Mistral-7B-GGUF - pre-configured for functions and tools API support!
If you are an Intel-GPU owner, the Intel profile for AIO images is now available too!

🚀 OpenVINO and transformers enhancements

Now there is support for OpenVINO and transformers got token streaming support now thanks to @fakezeta!

To try OpenVINO, you can use the example available in the documentation: https://localai.io/features/text-generation/#examples

🎈 Lot of small improvements behind the scenes!

Thanks for our outstanding community, we have enhanced several areas:

The build time of LocalAI was speed up significantly! thanks to @cryptk for the efforts in enhancing the build system
@thiner worked hardly to get Vision support for AutoGPTQ
... and much more! see down below for a full list, be sure to star LocalAI and give it a try!

📣 Spread the word!

And to those of you who've been heros, giving up your own time to help out fellow users on Discord and in our repo, you're absolutely amazing. We couldn't have asked for a better community.

Also, if you haven't yet joined our Discord, come on over! Here's the link: https://discord.gg/uJAeKSAGDy

Every bit of support, every mention, and every star adds up and helps us keep this ship sailing. Let's keep making LocalAI awesome together!

Thanks a ton, and here's to more exciting times ahead with LocalAI!

What's Changed

Bug fixes 🐛

fix: downgrade torch by @mudler in https://github.com/mudler/LocalAI/pull/1902
fix(aio): correctly detect intel systems by @mudler in https://github.com/mudler/LocalAI/pull/1931
fix(swagger): do not specify a host by @mudler in https://github.com/mudler/LocalAI/pull/1930
fix(tools): correctly render tools response in templates by @mudler in https://github.com/mudler/LocalAI/pull/1932
fix(grammar): respect JSONmode and grammar from user input by @mudler in https://github.com/mudler/LocalAI/pull/1935
fix(hermes-2-pro-mistral): add stopword for toolcall by @mudler in https://github.com/mudler/LocalAI/pull/1939
fix(functions): respect when selected from string by @mudler in https://github.com/mudler/LocalAI/pull/1940
fix: use exec in entrypoint scripts to fix signal handling by @cryptk in https://github.com/mudler/LocalAI/pull/1943
fix(hermes-2-pro-mistral): correct stopwords by @mudler in https://github.com/mudler/LocalAI/pull/1947
fix(welcome): stable model list by @mudler in https://github.com/mudler/LocalAI/pull/1949
fix(ci): manually tag latest images by @mudler in https://github.com/mudler/LocalAI/pull/1948
fix(seed): generate random seed per-request if -1 is set by @mudler in https://github.com/mudler/LocalAI/pull/1952
fix regression #1971 by @fakezeta in https://github.com/mudler/LocalAI/pull/1972

Exciting New Features 🎉

feat(aio): add intel profile by @mudler in https://github.com/mudler/LocalAI/pull/1901
Enhance autogptq backend to support VL models by @thiner in https://github.com/mudler/LocalAI/pull/1860
feat(assistant): Assistant and AssistantFiles api by @christ66 in https://github.com/mudler/LocalAI/pull/1803
feat: Openvino runtime for transformer backend and streaming support for Openvino and CUDA by @fakezeta in https://github.com/mudler/LocalAI/pull/1892
feat: Token Stream support for Transformer, fix: missing package for OpenVINO by @fakezeta in https://github.com/mudler/LocalAI/pull/1908
feat(welcome): add simple welcome page by @mudler in https://github.com/mudler/LocalAI/pull/1912
fix(build): better CI logging and correct some build failure modes in Makefile by @cryptk in https://github.com/mudler/LocalAI/pull/1899
feat(webui): add partials, show backends associated to models by @mudler in https://github.com/mudler/LocalAI/pull/1922
feat(swagger): Add swagger API doc by @mudler in https://github.com/mudler/LocalAI/pull/1926
feat(build): adjust number of parallel make jobs by @cryptk in https://github.com/mudler/LocalAI/pull/1915
feat(swagger): update by @mudler in https://github.com/mudler/LocalAI/pull/1929
feat: first pass at improving logging by @cryptk in https://github.com/mudler/LocalAI/pull/1956
fix(llama.cpp): set better defaults for llama.cpp by @mudler in https://github.com/mudler/LocalAI/pull/1961

📖 Documentation and examples

docs(aio-usage): update docs to show examples by @mudler in https://github.com/mudler/LocalAI/pull/1921

👒 Dependencies

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1903
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1904
⬆️ Update M0Rf30/go-tiny-dream by @M0Rf30 in https://github.com/mudler/LocalAI/pull/1911
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1913
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1914
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1923
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1924
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1928
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1933
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1934
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1937
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1941
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1953
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1958
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1959
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1964

Other Changes

⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1927
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1960
fix(hermes-2-pro-mistral): correct dashes in template to suppress newlines by @mudler in https://github.com/mudler/LocalAI/pull/1966
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1969
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1970
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1973

New Contributors

@thiner made their first contribution in https://github.com/mudler/LocalAI/pull/1860

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.11.0...v2.12.3

LocalAI - v2.12.1

Published by mudler 6 months ago

I'm happy to announce the v2.12.1 LocalAI release is out!

🌠 Landing page and Swagger

Ever wondered what to do after LocalAI is up and running? Integration with a simple web interface has been started, and you can see now a landing page when hitting the LocalAI front page:

Screenshot from 2024-04-07 14-43-26

You can also now enjoy Swagger to try out the API calls directly:

swagger

🌈 AIO images changes

🚀 OpenVINO and transformers enhancements

Now there is support for OpenVINO and transformers got token streaming support now thanks to @fakezeta!

To try OpenVINO, you can use the example available in the documentation: https://localai.io/features/text-generation/#examples

🎈 Lot of small improvements behind the scenes!

Thanks for our outstanding community, we have enhanced several areas:

The build time of LocalAI was speed up significantly! thanks to @cryptk for the efforts in enhancing the build system
@thiner worked hardly to get Vision support for AutoGPTQ
... and much more! see down below for a full list, be sure to star LocalAI and give it a try!

📣 Spread the word!

And to those of you who've been heros, giving up your own time to help out fellow users on Discord and in our repo, you're absolutely amazing. We couldn't have asked for a better community.

Also, if you haven't yet joined our Discord, come on over! Here's the link: https://discord.gg/uJAeKSAGDy

Every bit of support, every mention, and every star adds up and helps us keep this ship sailing. Let's keep making LocalAI awesome together!

Thanks a ton, and here's to more exciting times ahead with LocalAI!

What's Changed

Bug fixes 🐛

fix: downgrade torch by @mudler in https://github.com/mudler/LocalAI/pull/1902
fix(aio): correctly detect intel systems by @mudler in https://github.com/mudler/LocalAI/pull/1931
fix(swagger): do not specify a host by @mudler in https://github.com/mudler/LocalAI/pull/1930
fix(tools): correctly render tools response in templates by @mudler in https://github.com/mudler/LocalAI/pull/1932
fix(grammar): respect JSONmode and grammar from user input by @mudler in https://github.com/mudler/LocalAI/pull/1935
fix(hermes-2-pro-mistral): add stopword for toolcall by @mudler in https://github.com/mudler/LocalAI/pull/1939
fix(functions): respect when selected from string by @mudler in https://github.com/mudler/LocalAI/pull/1940
fix: use exec in entrypoint scripts to fix signal handling by @cryptk in https://github.com/mudler/LocalAI/pull/1943
fix(hermes-2-pro-mistral): correct stopwords by @mudler in https://github.com/mudler/LocalAI/pull/1947
fix(welcome): stable model list by @mudler in https://github.com/mudler/LocalAI/pull/1949
fix(ci): manually tag latest images by @mudler in https://github.com/mudler/LocalAI/pull/1948
fix(seed): generate random seed per-request if -1 is set by @mudler in https://github.com/mudler/LocalAI/pull/1952
fix regression #1971 by @fakezeta in https://github.com/mudler/LocalAI/pull/1972

Exciting New Features 🎉

feat(aio): add intel profile by @mudler in https://github.com/mudler/LocalAI/pull/1901
Enhance autogptq backend to support VL models by @thiner in https://github.com/mudler/LocalAI/pull/1860
feat(assistant): Assistant and AssistantFiles api by @christ66 in https://github.com/mudler/LocalAI/pull/1803
feat: Openvino runtime for transformer backend and streaming support for Openvino and CUDA by @fakezeta in https://github.com/mudler/LocalAI/pull/1892
feat: Token Stream support for Transformer, fix: missing package for OpenVINO by @fakezeta in https://github.com/mudler/LocalAI/pull/1908
feat(welcome): add simple welcome page by @mudler in https://github.com/mudler/LocalAI/pull/1912
fix(build): better CI logging and correct some build failure modes in Makefile by @cryptk in https://github.com/mudler/LocalAI/pull/1899
feat(webui): add partials, show backends associated to models by @mudler in https://github.com/mudler/LocalAI/pull/1922
feat(swagger): Add swagger API doc by @mudler in https://github.com/mudler/LocalAI/pull/1926
feat(build): adjust number of parallel make jobs by @cryptk in https://github.com/mudler/LocalAI/pull/1915
feat(swagger): update by @mudler in https://github.com/mudler/LocalAI/pull/1929
feat: first pass at improving logging by @cryptk in https://github.com/mudler/LocalAI/pull/1956
fix(llama.cpp): set better defaults for llama.cpp by @mudler in https://github.com/mudler/LocalAI/pull/1961

📖 Documentation and examples

docs(aio-usage): update docs to show examples by @mudler in https://github.com/mudler/LocalAI/pull/1921

👒 Dependencies

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1903
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1904
⬆️ Update M0Rf30/go-tiny-dream by @M0Rf30 in https://github.com/mudler/LocalAI/pull/1911
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1913
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1914
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1923
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1924
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1928
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1933
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1934
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1937
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1941
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1953
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1958
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1959
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1964

Other Changes

⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1927
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1960
fix(hermes-2-pro-mistral): correct dashes in template to suppress newlines by @mudler in https://github.com/mudler/LocalAI/pull/1966
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1969
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1970
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1973

New Contributors

@thiner made their first contribution in https://github.com/mudler/LocalAI/pull/1860

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.11.0...v2.12.1

LocalAI - v2.12.0

Published by mudler 6 months ago

I'm happy to announce the v2.12.0 LocalAI release is out!

🌠 Landing page and Swagger

Ever wondered what to do after LocalAI is up and running? Integration with a simple web interface has been started, and you can see now a landing page when hitting the LocalAI front page:

Screenshot from 2024-04-07 14-43-26

You can also now enjoy Swagger to try out the API calls directly:

swagger

🌈 AIO images changes

🚀 OpenVINO and transformers enhancements

Now there is support for OpenVINO and transformers got token streaming support now thanks to @fakezeta!

To try OpenVINO, you can use the example available in the documentation: https://localai.io/features/text-generation/#examples

🎈 Lot of small improvements behind the scenes!

Thanks for our outstanding community, we have enhanced several areas:

The build time of LocalAI was speed up significantly! thanks to @cryptk for the efforts in enhancing the build system
@thiner worked hardly to get Vision support for AutoGPTQ
... and much more! see down below for a full list, be sure to star LocalAI and give it a try!

📣 Spread the word!

And to those of you who've been heros, giving up your own time to help out fellow users on Discord and in our repo, you're absolutely amazing. We couldn't have asked for a better community.

Also, if you haven't yet joined our Discord, come on over! Here's the link: https://discord.gg/uJAeKSAGDy

Every bit of support, every mention, and every star adds up and helps us keep this ship sailing. Let's keep making LocalAI awesome together!

Thanks a ton, and here's to more exciting times ahead with LocalAI!

What's Changed

Bug fixes 🐛

fix: downgrade torch by @mudler in https://github.com/mudler/LocalAI/pull/1902
fix(aio): correctly detect intel systems by @mudler in https://github.com/mudler/LocalAI/pull/1931
fix(swagger): do not specify a host by @mudler in https://github.com/mudler/LocalAI/pull/1930
fix(tools): correctly render tools response in templates by @mudler in https://github.com/mudler/LocalAI/pull/1932
fix(grammar): respect JSONmode and grammar from user input by @mudler in https://github.com/mudler/LocalAI/pull/1935
fix(hermes-2-pro-mistral): add stopword for toolcall by @mudler in https://github.com/mudler/LocalAI/pull/1939
fix(functions): respect when selected from string by @mudler in https://github.com/mudler/LocalAI/pull/1940
fix: use exec in entrypoint scripts to fix signal handling by @cryptk in https://github.com/mudler/LocalAI/pull/1943
fix(hermes-2-pro-mistral): correct stopwords by @mudler in https://github.com/mudler/LocalAI/pull/1947
fix(welcome): stable model list by @mudler in https://github.com/mudler/LocalAI/pull/1949
fix(ci): manually tag latest images by @mudler in https://github.com/mudler/LocalAI/pull/1948
fix(seed): generate random seed per-request if -1 is set by @mudler in https://github.com/mudler/LocalAI/pull/1952
fix regression #1971 by @fakezeta in https://github.com/mudler/LocalAI/pull/1972

Exciting New Features 🎉

feat(aio): add intel profile by @mudler in https://github.com/mudler/LocalAI/pull/1901
Enhance autogptq backend to support VL models by @thiner in https://github.com/mudler/LocalAI/pull/1860
feat(assistant): Assistant and AssistantFiles api by @christ66 in https://github.com/mudler/LocalAI/pull/1803
feat: Openvino runtime for transformer backend and streaming support for Openvino and CUDA by @fakezeta in https://github.com/mudler/LocalAI/pull/1892
feat: Token Stream support for Transformer, fix: missing package for OpenVINO by @fakezeta in https://github.com/mudler/LocalAI/pull/1908
feat(welcome): add simple welcome page by @mudler in https://github.com/mudler/LocalAI/pull/1912
fix(build): better CI logging and correct some build failure modes in Makefile by @cryptk in https://github.com/mudler/LocalAI/pull/1899
feat(webui): add partials, show backends associated to models by @mudler in https://github.com/mudler/LocalAI/pull/1922
feat(swagger): Add swagger API doc by @mudler in https://github.com/mudler/LocalAI/pull/1926
feat(build): adjust number of parallel make jobs by @cryptk in https://github.com/mudler/LocalAI/pull/1915
feat(swagger): update by @mudler in https://github.com/mudler/LocalAI/pull/1929
feat: first pass at improving logging by @cryptk in https://github.com/mudler/LocalAI/pull/1956
fix(llama.cpp): set better defaults for llama.cpp by @mudler in https://github.com/mudler/LocalAI/pull/1961

📖 Documentation and examples

docs(aio-usage): update docs to show examples by @mudler in https://github.com/mudler/LocalAI/pull/1921

👒 Dependencies

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1903
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1904
⬆️ Update M0Rf30/go-tiny-dream by @M0Rf30 in https://github.com/mudler/LocalAI/pull/1911
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1913
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1914
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1923
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1924
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1928
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1933
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1934
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1937
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1941
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1953
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1958
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1959
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1964

Other Changes

⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1927
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1960
fix(hermes-2-pro-mistral): correct dashes in template to suppress newlines by @mudler in https://github.com/mudler/LocalAI/pull/1966
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1969
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1970
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1973

New Contributors

@thiner made their first contribution in https://github.com/mudler/LocalAI/pull/1860

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.11.0...v2.12.0

LocalAI - v2.11.0

Published by mudler 7 months ago

Introducing LocalAI v2.11.0: All-in-One Images!

Hey everyone! 🎉 I'm super excited to share what we've been working on at LocalAI - the launch of v2.11.0. This isn't just any update; it's a massive leap forward, making LocalAI easier to use, faster, and more accessible for everyone.

🌠 The Spotlight: All-in-One Images, OpenAI in a box

Imagine having a magic box that, once opened, gives you everything you need to get your AI project off the ground with generative AI. A full clone of OpenAI in a box. That's exactly what our AIO images are! Designed for both CPU and GPU environments, these images come pre-packed with a full suite of models and backends, ready to go right out of the box.

Whether you're using Nvidia, AMD, or Intel, we've got an optimized image for you. If you are using CPU-only you can enjoy even smaller and lighter images.

To start LocalAI, pre-configured with function calling, llm, tts, speech to text, and image generation, just run:

docker run -p 8080:8080 --name local-ai -ti localai/localai:latest-aio-cpu

## Do you have a Nvidia GPUs? Use this instead
## CUDA 11
# docker run -p 8080:8080 --gpus all --name local-ai -ti localai/localai:latest-aio-gpu-cuda-11
## CUDA 12
# docker run -p 8080:8080 --gpus all --name local-ai -ti localai/localai:latest-aio-gpu-cuda-12

❤️ Why You're Going to Love AIO Images:

Ease of Use: Say goodbye to the setup blues. With AIO images, everything is configured upfront, so you can dive straight into the fun part - hacking!
Flexibility: CPU, Nvidia, AMD, Intel? We support them all. These images are made to adapt to your setup, not the other way around.
Speed: Spend less time configuring and more time innovating. Our AIO images are all about getting you across the starting line as fast as possible.

🌈 Jumping In Is a Breeze:

Getting started with AIO images is as simple as pulling from Docker Hub or Quay and running it. We take care of the rest, downloading all necessary models for you. For all the details, including how to customize your setup with environment variables, our updated docs have got you covered here, while you can get more details of the AIO images here.

🎈 Vector Store

Thanks to the great contribution from @richiejp now LocalAI has a new backend type, "vector stores" that allows to use LocalAI as in-memory Vector DB (https://github.com/mudler/LocalAI/issues/1792). You can learn more about it here!

🐛 Bug fixes

This release contains major bugfixes to the watchdog component, and a fix to a regression introduced in v2.10.x which was not respecting --f16, --threads and --context-size to be applied as model's defaults.

🎉 New Model defaults for llama.cpp

Model defaults has changed to automatically offload maximum GPU layers if a GPU is available, and it sets saner defaults to the models to enhance the LLM's output.

🧠 New pre-configured models

You can now run llava-1.6-vicuna, llava-1.6-mistral and hermes-2-pro-mistral, see Run other models for a list of all the pre-configured models available in the release.

📣 Spread the word!

And to those of you who've been heros, giving up your own time to help out fellow users on Discord and in our repo, you're absolutely amazing. We couldn't have asked for a better community.

Also, if you haven't yet joined our Discord, come on over! Here's the link: https://discord.gg/uJAeKSAGDy

Every bit of support, every mention, and every star adds up and helps us keep this ship sailing. Let's keep making LocalAI awesome together!

Thanks a ton, and here's to more exciting times ahead with LocalAI!

🔗 Links

Quickstart docs (how to run with AIO images): https://localai.io/basics/getting_started/
More reference on AIO image: https://localai.io/docs/reference/aio-images/
List of embedded models that can be started: https://localai.io/docs/getting-started/run-other-models/

🎁 What's More in v2.11.0?

Bug fixes 🐛

fix(config): pass by config options, respect defaults by @mudler in https://github.com/mudler/LocalAI/pull/1878
fix(watchdog): use ShutdownModel instead of StopModel by @mudler in https://github.com/mudler/LocalAI/pull/1882
NVIDIA GPU detection support for WSL2 environments by @enricoros in https://github.com/mudler/LocalAI/pull/1891
Fix NVIDIA VRAM detection on WSL2 environments by @enricoros in https://github.com/mudler/LocalAI/pull/1894

Exciting New Features 🎉

feat(functions/aio): all-in-one images, function template enhancements by @mudler in https://github.com/mudler/LocalAI/pull/1862
feat(aio): entrypoint, update workflows by @mudler in https://github.com/mudler/LocalAI/pull/1872
feat(aio): add tests, update model definitions by @mudler in https://github.com/mudler/LocalAI/pull/1880
feat(stores): Vector store backend by @richiejp in https://github.com/mudler/LocalAI/pull/1795
ci(aio): publish hipblas and Intel GPU images by @mudler in https://github.com/mudler/LocalAI/pull/1883
ci(aio): add latest tag images by @mudler in https://github.com/mudler/LocalAI/pull/1884

🧠 Models

feat(models): add phi-2-chat, llava-1.6, bakllava, cerbero by @mudler in https://github.com/mudler/LocalAI/pull/1879

📖 Documentation and examples

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1856
docs(mac): improve documentation for mac build by @tauven in https://github.com/mudler/LocalAI/pull/1873
docs(aio): Add All-in-One images docs by @mudler in https://github.com/mudler/LocalAI/pull/1887
fix(aio): make image-gen for GPU functional, update docs by @mudler in https://github.com/mudler/LocalAI/pull/1895

👒 Dependencies

⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1508
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1857
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1864
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1866
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1867
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1874
⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1875
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1881
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1885
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1889

Other Changes

⬆️ Update ggerganov/whisper.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1896
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1897

New Contributors

@enricoros made their first contribution in https://github.com/mudler/LocalAI/pull/1891

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.10.1...v2.11.0

LocalAI - v2.10.1

Published by mudler 7 months ago

What's Changed

Bug fixes 🐛

fix(llama.cpp): fix eos without cache by @mudler in https://github.com/mudler/LocalAI/pull/1852
fix(config): default to debug=false if not set by @mudler in https://github.com/mudler/LocalAI/pull/1853
fix(config-watcher): start only if config-directory exists by @mudler in https://github.com/mudler/LocalAI/pull/1854

Exciting New Features 🎉

deps(whisper.cpp): update, fix cublas build by @mudler in https://github.com/mudler/LocalAI/pull/1846

Other Changes

fixes #1051: handle openai presence and request penalty parameters by @blob42 in https://github.com/mudler/LocalAI/pull/1817
fix(make): allow to parallelize jobs by @cryptk in https://github.com/mudler/LocalAI/pull/1845
fix(go-llama): use llama-cpp as default by @mudler in https://github.com/mudler/LocalAI/pull/1849
⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1847
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1848
test/fix: OSX Test Repair by @dave-gray101 in https://github.com/mudler/LocalAI/pull/1843

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.10.0...v2.10.1

LocalAI - v2.10.0

Published by mudler 7 months ago

LocalAI v2.10.0 Release Notes

Excited to announce the release of LocalAI v2.10.0! This version introduces significant changes, including breaking changes, numerous bug fixes, exciting new features, dependency updates, and more. Here's a summary of what's new:

Breaking Changes 🛠

The trust_remote_code setting in the YAML config file of the model are now consumed for enhanced security measures also for the AutoGPTQ and transformers backend, thanks to @dave-gray101's contribution (#1799). If your model relied on the old behavior and you are sure of what you are doing, set trust_remote_code: true in the YAML config file.

Bug Fixes 🐛

Various fixes have been implemented to enhance the stability and performance of LocalAI:
- SSE no longer omits empty finish_reason fields for better compatibility with the OpenAI API, fixed by @mudler (#1745).
- Functions now correctly handle scenarios with no results, also addressed by @mudler (#1758).
- A Command Injection Vulnerability has been fixed by @ouxs-19 (#1778).
- OpenCL-based builds for llama.cpp have been restored, thanks to @cryptk's efforts (#1828, #1830).
- An issue with OSX build default.metallib has been resolved, which should now allow running the llama-cpp backend on Apple arm64, fixed by @dave-gray101 (#1837).

Exciting New Features 🎉

LocalAI continues to evolve with several new features:
- Ongoing implementation of the assistants API, making great progress thanks to community contributions, including an initial implementation by @christ66 (#1761).
- Addition of diffusers/transformers support for Intel GPU - now you can generate images and use the transformer backend also on Intel GPUs, implemented by @mudler (#1746).
- Introduction of Bitsandbytes quantization for transformer backend enhancement and a fix for transformer backend error on CUDA by @fakezeta (#1823).
- Compatibility layers for Elevenlabs and OpenAI TTS, enhancing text-to-speech capabilities: Now LocalAI is compatible with Elevenlabs and OpenAI TTS, thanks to @mudler (#1834).
- vLLM now supports stream: true! This feature was introduced by @golgeek (#1749).

Dependency Updates 👒

Our continuous effort to keep dependencies up-to-date includes multiple updates to ggerganov/llama.cpp, donomii/go-rwkv.cpp, mudler/go-stable-diffusion, and others, ensuring that LocalAI is built on the latest and most secure libraries.

Other Changes

Several internal changes have been made to improve the development process and documentation, including updates to integration guides, stress reduction on self-hosted runners, and more.

Details of What's Changed

Breaking Changes 🛠

feat(autogpt/transformers): consume trust_remote_code by @dave-gray101 in https://github.com/mudler/LocalAI/pull/1799

Bug fixes 🐛

fix(sse): do not omit empty finish_reason by @mudler in https://github.com/mudler/LocalAI/pull/1745
fix(functions): handle correctly when there are no results by @mudler in https://github.com/mudler/LocalAI/pull/1758
fix(tests): re-enable tests after code move by @mudler in https://github.com/mudler/LocalAI/pull/1764
Fix Command Injection Vulnerability by @ouxs-19 in https://github.com/mudler/LocalAI/pull/1778
fix: the correct BUILD_TYPE for OpenCL is clblas (with no t) by @cryptk in https://github.com/mudler/LocalAI/pull/1828
fix: missing OpenCL libraries from docker containers during clblas docker build by @cryptk in https://github.com/mudler/LocalAI/pull/1830
fix: osx build default.metallib by @dave-gray101 in https://github.com/mudler/LocalAI/pull/1837

Exciting New Features 🎉

fix: vllm - use AsyncLLMEngine to allow true streaming mode by @golgeek in https://github.com/mudler/LocalAI/pull/1749
refactor: move remaining api packages to core by @dave-gray101 in https://github.com/mudler/LocalAI/pull/1731
Bump vLLM version + more options when loading models in vLLM by @golgeek in https://github.com/mudler/LocalAI/pull/1782
feat(assistant): Initial implementation of assistants api by @christ66 in https://github.com/mudler/LocalAI/pull/1761
feat(intel): add diffusers/transformers support by @mudler in https://github.com/mudler/LocalAI/pull/1746
fix(config): set better defaults for inferencing by @mudler in https://github.com/mudler/LocalAI/pull/1822
fix(docker-compose): update docker compose file by @mudler in https://github.com/mudler/LocalAI/pull/1824
feat(model-help): display help text in markdown by @mudler in https://github.com/mudler/LocalAI/pull/1825
feat: Add Bitsandbytes quantization for transformer backend enhancement #1775 and fix: Transformer backend error on CUDA #1774 by @fakezeta in https://github.com/mudler/LocalAI/pull/1823
feat(tts): add Elevenlabs and OpenAI TTS compatibility layer by @mudler in https://github.com/mudler/LocalAI/pull/1834
feat(embeddings): do not require to be configured by @mudler in https://github.com/mudler/LocalAI/pull/1842

👒 Dependencies

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1752
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1753
deps(llama.cpp): update by @mudler in https://github.com/mudler/LocalAI/pull/1759
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1756
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1767
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1772
⬆️ Update donomii/go-rwkv.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1771
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1779
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1789
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1791
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1794
depedencies(sentencentranformers): update dependencies by @TwinFinz in https://github.com/mudler/LocalAI/pull/1797
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1801
⬆️ Update mudler/go-stable-diffusion by @localai-bot in https://github.com/mudler/LocalAI/pull/1802
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1805
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1811
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1827

Other Changes

ci: add stablediffusion to release by @sozercan in https://github.com/mudler/LocalAI/pull/1757
Update integrations.md by @Joshhua5 in https://github.com/mudler/LocalAI/pull/1765
ci: reduce stress on self-hosted runners by @mudler in https://github.com/mudler/LocalAI/pull/1776
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1785
Revert "feat(assistant): Initial implementation of assistants api" by @mudler in https://github.com/mudler/LocalAI/pull/1790
Edit links in readme and integrations page by @lunamidori5 in https://github.com/mudler/LocalAI/pull/1796
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1813
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1816
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1818
fix(doc/examples): set defaults to mirostat by @mudler in https://github.com/mudler/LocalAI/pull/1820
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1821
fix: OSX Build Files for llama.cpp by @dave-gray101 in https://github.com/mudler/LocalAI/pull/1836
⬆️ Update go-skynet/go-llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1835
docs(transformers): add docs section about transformers by @mudler in https://github.com/mudler/LocalAI/pull/1841
⬆️ Update mudler/go-piper by @localai-bot in https://github.com/mudler/LocalAI/pull/1844
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1840

New Contributors

@golgeek made their first contribution in https://github.com/mudler/LocalAI/pull/1749
@Joshhua5 made their first contribution in https://github.com/mudler/LocalAI/pull/1765
@ouxs-19 made their first contribution in https://github.com/mudler/LocalAI/pull/1778
@TwinFinz made their first contribution in https://github.com/mudler/LocalAI/pull/1797
@cryptk made their first contribution in https://github.com/mudler/LocalAI/pull/1828
@fakezeta made their first contribution in https://github.com/mudler/LocalAI/pull/1823

Thank you to all contributors and users for your continued support and feedback, making LocalAI better with each release!

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.9.0...v2.10.0

LocalAI - v2.9.0

Published by mudler 8 months ago

This release brings many enhancements, fixes, and a special thanks to the community for the amazing work and contributions!

We now have sycl images for Intel GPUs, ROCm images for AMD GPUs,and much more:

You can find the AMD GPU images tags between the container images available - look for hipblas. For example, master-hipblas-ffmpeg-core. Thanks to @fenfir for this nice contribution!
Intel GPU images are tagged with sycl. You can find images with two flavors, sycl-f16 and sycl-f32 respectively. For example, master-sycl-f16. Work is in progress to support also diffusers and transformers on Intel GPUs.
Thanks to @christ66 first efforts in supporting the Assistant API were made, and we are planning to support the Assistant API! Stay tuned for more!
Now LocalAI supports the Tools API endpoint - it also supports the (now deprecated) functions API call as usual. We now also have support for SSE with function calling. See https://github.com/mudler/LocalAI/pull/1726 for more
Support for Gemma models - did you hear? Google released OSS models and LocalAI supports it already!
Thanks to @dave-gray101 in https://github.com/mudler/LocalAI/pull/1728 to put efforts in refactoring parts of the code - we are going to support soon more ways to interface with LocalAI, and not only restful api!

Support the project

First off, a massive thank you to each and every one of you who've chipped in to squash bugs and suggest cool new features for LocalAI. Your help, kind words, and brilliant ideas are truly appreciated - more than words can say!

And to those of you who've been heros, giving up your own time to help out fellow users on Discord and in our repo, you're absolutely amazing. We couldn't have asked for a better community.

Also, if you haven't yet joined our Discord, come on over! Here's the link: https://discord.gg/uJAeKSAGDy

Every bit of support, every mention, and every star adds up and helps us keep this ship sailing. Let's keep making LocalAI awesome together!

Thanks a ton, and here's to more exciting times ahead with LocalAI! 🚀

What's Changed

Bug fixes 🐛

Add TTS dependency for cuda based builds fixes #1727 by @blob42 in https://github.com/mudler/LocalAI/pull/1730

Exciting New Features 🎉

Build docker container for ROCm by @fenfir in https://github.com/mudler/LocalAI/pull/1595
feat(tools): support Tool calls in the API by @mudler in https://github.com/mudler/LocalAI/pull/1715
Initial implementation of upload files api. by @christ66 in https://github.com/mudler/LocalAI/pull/1703
feat(tools): Parallel function calling by @mudler in https://github.com/mudler/LocalAI/pull/1726
refactor: move part of api packages to core by @dave-gray101 in https://github.com/mudler/LocalAI/pull/1728
deps(llama.cpp): update, support Gemma models by @mudler in https://github.com/mudler/LocalAI/pull/1734

👒 Dependencies

deps(llama.cpp): update by @mudler in https://github.com/mudler/LocalAI/pull/1714
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1740

Other Changes

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1718
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1705
Update README.md by @lunamidori5 in https://github.com/mudler/LocalAI/pull/1739
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1750

New Contributors

@fenfir made their first contribution in https://github.com/mudler/LocalAI/pull/1595
@christ66 made their first contribution in https://github.com/mudler/LocalAI/pull/1703
@blob42 made their first contribution in https://github.com/mudler/LocalAI/pull/1730

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.8.2...v2.9.0

LocalAI - v2.8.2

Published by mudler 8 months ago

What's Changed

Bug fixes 🐛

fix(tts): fix regression when supplying backend from requests by @mudler in https://github.com/mudler/LocalAI/pull/1713

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.8.1...v2.8.2

LocalAI - v2.8.1

Published by mudler 8 months ago

This is a patch release, mostly containing minor patches and bugfixes from 2.8.0.

Most importantly it contains a bugfix for https://github.com/mudler/LocalAI/issues/1333 which made the llama.cpp backend to get stuck in some cases where the model starts to hallucinate and fixes to the python-based backends.

Spread the word!

And to those of you who've been heros, giving up your own time to help out fellow users on Discord and in our repo, you're absolutely amazing. We couldn't have asked for a better community.

Also, if you haven't yet joined our Discord, come on over! Here's the link: https://discord.gg/uJAeKSAGDy

Every bit of support, every mention, and every star adds up and helps us keep this ship sailing. Let's keep making LocalAI awesome together!

Thanks a ton, and here's to more exciting times ahead with LocalAI! 🚀

What's Changed

Bug fixes 🐛

fix(vall-e-x): Fix voice cloning by @mudler in https://github.com/mudler/LocalAI/pull/1696
fix: drop unused code by @mudler in https://github.com/mudler/LocalAI/pull/1697
fix(llama.cpp): disable infinite context shifting by @mudler in https://github.com/mudler/LocalAI/pull/1704
fix(llama.cpp): downgrade to a known working version by @mudler in https://github.com/mudler/LocalAI/pull/1706
fix(python): pin exllama2 by @mudler in https://github.com/mudler/LocalAI/pull/1711

Exciting New Features 🎉

feat(tts): respect YAMLs config file, add sycl docs/examples by @mudler in https://github.com/mudler/LocalAI/pull/1692
ci: add cuda builds to release by @sozercan in https://github.com/mudler/LocalAI/pull/1702

Other Changes

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1693
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1694
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1698
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1700

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.8.0...v2.8.1

LocalAI - v2.8.0

Published by mudler 8 months ago

This release adds support for Intel GPUs, and it deprecates old ggml-based backends which are by now superseded by llama.cpp (that now supports more architectures out-of-the-box). See also https://github.com/mudler/LocalAI/issues/1651.

Images are now based on Ubuntu 22.04 LTS instead of Debian bullseye.

Intel GPUs

There are now images tagged with "sycl". There are sycl-f16 and sycl-f32 images indicating f16 or f32 support.

For example, to start phi-2 with an Intel GPU it is enough to use the container image like this:

docker run -e DEBUG=true -ti -v $PWD/models:/build/models -p 8080:8080  -v /dev/dri:/dev/dri --rm quay.io/go-skynet/local-ai:master-sycl-f32-ffmpeg-core phi-2

Note

And to those of you who've been heros, giving up your own time to help out fellow users on Discord and in our repo, you're absolutely amazing. We couldn't have asked for a better community.

Also, if you haven't yet joined our Discord, come on over! Here's the link: https://discord.gg/uJAeKSAGDy

Every bit of support, every mention, and every star adds up and helps us keep this ship sailing. Let's keep making LocalAI awesome, together.

Thanks a ton, and here's to more exciting times ahead with LocalAI! 🚀

What's Changed

Exciting New Features 🎉

feat(sycl): Add support for Intel GPUs with sycl (#1647) by @mudler in https://github.com/mudler/LocalAI/pull/1660
Drop old falcon backend (deprecated) by @mudler in https://github.com/mudler/LocalAI/pull/1675
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1678
Drop ggml-based gpt2 and starcoder (supported by llama.cpp) by @mudler in https://github.com/mudler/LocalAI/pull/1679
fix(Dockerfile): sycl dependencies by @mudler in https://github.com/mudler/LocalAI/pull/1686
feat: Use ubuntu as base for container images, drop deprecated ggml-transformers backends by @mudler in https://github.com/mudler/LocalAI/pull/1689

👒 Dependencies

⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1656
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1665
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1669
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1673
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1683
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1688
⬆️ Update mudler/go-stable-diffusion by @localai-bot in https://github.com/mudler/LocalAI/pull/1674

Other Changes

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1661
feat(mamba): Add bagel-dpo-2.8b by @richiejp in https://github.com/mudler/LocalAI/pull/1671
fix (docs): fixed broken links github/ -> github.com/ by @Wansmer in https://github.com/mudler/LocalAI/pull/1672
Fix HTTP links in README.md by @vfiftyfive in https://github.com/mudler/LocalAI/pull/1677
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1681
ci: cleanup worker before run by @mudler in https://github.com/mudler/LocalAI/pull/1685
Revert "fix(Dockerfile): sycl dependencies" by @mudler in https://github.com/mudler/LocalAI/pull/1687
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1691

New Contributors

@richiejp made their first contribution in https://github.com/mudler/LocalAI/pull/1671
@Wansmer made their first contribution in https://github.com/mudler/LocalAI/pull/1672
@vfiftyfive made their first contribution in https://github.com/mudler/LocalAI/pull/1677

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.7.0...v2.8.0

LocalAI - v2.7.0

Published by mudler 9 months ago

This release adds support to the transformer backend for LLM as well!

For now instance you can run codellama-7b with transformers with:

docker run -ti -p 8080:8080 --gpus all localai/localai:v2.7.0-cublas-cuda12 codellama-7b

In the quickstart there are more examples available https://localai.io/basics/getting_started/#running-models.

Note: As llama.cpp is ongoing with changes that could possible cause breakage, this release does not includes changes from https://github.com/ggerganov/llama.cpp/discussions/5138 (the future versions will).

What's Changed

Bug fixes 🐛

fix(paths): automatically create paths by @mudler in https://github.com/mudler/LocalAI/pull/1650

Exciting New Features 🎉

feat(transformers): support also text generation by @mudler in https://github.com/mudler/LocalAI/pull/1630
transformers: correctly load automodels by @mudler in https://github.com/mudler/LocalAI/pull/1643
feat(startup): fetch model definition remotely by @mudler in https://github.com/mudler/LocalAI/pull/1654

👒 Dependencies

⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1642
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1644
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1652
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1655

Other Changes

⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1632
⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1631

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.6.1...v2.6.2

LocalAI - v2.6.1

Published by mudler 9 months ago

This is a patch release containing bug-fixes around parallel request support with llama.cpp models.

What's Changed

Bug fixes 🐛

fix(llama.cpp): Enable parallel requests by @tauven in https://github.com/mudler/LocalAI/pull/1616
fix(llama.cpp): enable cont batching when parallel is set by @mudler in https://github.com/mudler/LocalAI/pull/1622

Exciting New Features 🎉

feat(grpc): backend SPI pluggable in embedding mode by @coyzeng in https://github.com/mudler/LocalAI/pull/1621

👒 Dependencies

⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1623

Other Changes

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1619
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1620
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1626

New Contributors

@tauven made their first contribution in https://github.com/mudler/LocalAI/pull/1616
@coyzeng made their first contribution in https://github.com/mudler/LocalAI/pull/1621

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.6.0...v2.6.1

LocalAI - v2.6.0

Published by mudler 9 months ago

What's Changed

Bug fixes 🐛

move BUILD_GRPC_FOR_BACKEND_LLAMA logic to makefile: errors in this section now immediately fail the build by @dionysius in https://github.com/mudler/LocalAI/pull/1576
prepend built binaries in PATH for BUILD_GRPC_FOR_BACKEND_LLAMA by @dionysius in https://github.com/mudler/LocalAI/pull/1593

Exciting New Features 🎉

minor: replace shell pwd in Makefile with CURDIR for better windows compatibility by @dionysius in https://github.com/mudler/LocalAI/pull/1571
Makefile: allow to build without GRPC_BACKENDS by @mudler in https://github.com/mudler/LocalAI/pull/1607
feat: 🐍 add mamba support by @mudler in https://github.com/mudler/LocalAI/pull/1589
feat(extra-backends): Improvements, adding mamba example by @mudler in https://github.com/mudler/LocalAI/pull/1618

👒 Dependencies

⬆️ Update docs version mudler/LocalAI by @localai-bot in https://github.com/mudler/LocalAI/pull/1567
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1568
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1573
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1578
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1583
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1587
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1590
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1594
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1599

Other Changes

Moving the how tos to self hosted by @lunamidori5 in https://github.com/mudler/LocalAI/pull/1574
docs: missing golang requirement for local build for debian by @dionysius in https://github.com/mudler/LocalAI/pull/1596
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1597
docs/examples: enhancements by @mudler in https://github.com/mudler/LocalAI/pull/1572
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1604
Update README.md by @lunamidori5 in https://github.com/mudler/LocalAI/pull/1601
docs: re-use original permalinks by @mudler in https://github.com/mudler/LocalAI/pull/1610
⬆️ Update ggerganov/llama.cpp by @localai-bot in https://github.com/mudler/LocalAI/pull/1612
Expanded and interlinked Docker documentation by @jamesbraza in https://github.com/mudler/LocalAI/pull/1614
Modernized LlamaIndex integration by @jamesbraza in https://github.com/mudler/LocalAI/pull/1613

New Contributors

@dionysius made their first contribution in https://github.com/mudler/LocalAI/pull/1571

Full Changelog: https://github.com/mudler/LocalAI/compare/v2.5.1...v2.6.0

Package Rankings

Top 5.37% on Proxy.golang.org

Badges

Extracted from project README