LLMOps

Deploy and scale Large Language Models (LLMs) in production.

MIT License

Stars
35

LLMOps - Language Model Operations

Overview

This repository contains two Python examples designed for fine-tuning, deploying and scaling language models using Modal, Langchain, Fastapi, VLLM and Hugging Face's Transformers.

Author

Prince Canuma - An MLOPs Engineer and founder at Kulissiwa. Previously, he worked as a ML Engineer at neptune.ai. He is passionate about MLOps, Deep Learning, and Software Engineering.

Contributions

Contributions to this project are welcome. Please follow the standard procedures for submitting issues and pull requests.

License

This project is licensed under the MIT License - see the LICENSE file for details.