+ ⇨ Ollama LangChain Guide

Develop LangChain using local LLMs with Ollama

LLM costs getting you down?
Want to develop offline?
Don't want to share your personal data with LLM providers?
Save costs, develop anywhere, and own all your data with Ollama and LangChain!

Before you start

This tutorial requires several terminals to be open and running proccesses at once i.e.: to run various Ollama servers.
When you see the 🆕 emoji before a set of terminal commands, open a new terminal process.
When you see the ♻️ emoji before a set of terminal commands, you can re-use the same terminal you used last time.

Prerequisites

Download and install Ollama and start the server.

🆕

curl -fsSL https://ollama.com/install.sh | sh
ollama serve

Download and install Poetry.
Fork this repository and setup the Poetry environment:

🆕

git clone https://github.com/Cutwell/ollama-langchain-guide.git
cd ollama-langchain-guide
poetry install

Tutorial

Browse the available Ollama models and select a model.

Think about your local computers available RAM and GPU memory when picking the model + quantisation level.
We will be using the phi-2 model from Microsoft (Ollama, Hugging Face) as it is both small and fast.
Read this summary for advice on prompting the phi-2 model optimally.

♻️

ollama pull phi

Start the Ollama server.

This server can be queried with LangChain, or you can interact with it in this terminal.

♻️

ollama run phi

Run the PyTest tests in /ollama_langchain_guide/tests to check everything is working correctly.

🆕

poetry run pytest -rP ollama_langchain_guide/tests

Get started building your own local LLM projects with the example StreamLit app in /ollama_langchain_guide/src.

♻️

poetry run streamlit run ollama_langchain_guide/src/app.py --server.port=8080

Pros and Cons of Phi-2

Pros	Cons
Natural language, human-like outputs.	Can distract itself, prone to creating logic puzzles based on user queries + tries to solve them itself.
Context window of 2048 tokens - can use chat history in answers.	Often ignores established facts in chat history - answers same question multiple ways in the same conversation.
Can output syntax-correct Python code.	Bad at generating code that achieves desired goal - e.g.: outputs a syntax-correct function to calculate Pi, but the outputs are garbage.
Very fast response time.

License

MIT

Related Projects

nollama

NoLlama is a lightweight terminal-based alternative to Ollama, enabling interaction with large la...

23 Aug 2024 4

discollama

Run an AI-powered Discord bot from the comfort of your laptop.

30 Jul 2023 127

LocalPrompt

"A simple and lightweight client-server program for interfacing with local LLMs using ollama, and...

20 Jul 2024 0

local-ai-assistant

A Streamlit app that uses Ollama and the LangChain library to create a chat interface with a loca...

13 Aug 2024 1

VT.ai

VT.ai - Multimodal AI Chatbot

21 Apr 2024 37

Llama-2-Open-Source-LLM-CPU-Inference

Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A

06 Jul 2023 945

ollama-python

Ollama Python library

09 Dec 2023 2,880

Ollama-Tools-Integration-with-FastAPI-Demo

A demonstration of tool integration with Ollama in a FastAPI application. This project demonstra...

23 Aug 2024 1

local-LLM-with-RAG

Running local Language Language Models (LLM) to perform Retrieval-Augmented Generation (RAG)

05 Nov 2023 157

CDoc

CDoc lets you chat with your documents using local LLMs, combining Ollama, ChromaDB, and LangChai...

28 Jul 2024 4

simple-ai-agents

Simple AI agents / assistants

10 Nov 2023 21