prompt.fail

prompt.fail explores prompt injection techniques in large language models (LLMs), providing examples to improve LLM security and robustness.

GPL-3.0 License

Stars

6

Committers

View Code on GitHub Visit Website

Ecosystems: Llama

prompt.fail

Welcome to prompt.fail, a project dedicated to exploring and documenting techniques for prompt injection in large language models (LLMs). Our mission is to enhance the security and robustness of LLMs by identifying and understanding how malicious prompts can manipulate these models. By sharing and analyzing these techniques, we aim to build a community that contributes to the development of more resilient AI systems.

Table of Contents

🔓 What is Prompt Injection?

Prompt injection is a critical area of study in the field of AI safety and security. It involves crafting specific inputs (prompts) that can cause large language models to behave in unintended or harmful ways. Understanding these vulnerabilities is essential for improving the design and implementation of future AI systems.

OWASP Top 10 for Large Language Model Applications

You can find the prompt injection techniques in the first position of the OWASP Top 10 for Large Language Model Applications. The OWASP Top 10 for Large Language Model Applications is a list of the most critical security risks to be aware of when working with large language models (LLMs). OWASP says: "Manipulating LLMs via crafted inputs can lead to unauthorized access, data breaches, and compromised decision-making."

Why is Prompt Injection Important?

Prompt injection can lead to a wide range of security risks, including:

Data Leakage: Malicious prompts can cause LLMs to reveal sensitive information.
Bias Amplification: Biased prompts can reinforce or amplify existing biases in the model.
Adversarial Attacks: Attackers can manipulate LLMs to generate harmful or misleading content.
Privacy Violations: Prompts can be used to extract personal data or violate user privacy.

This repository is a collaborative effort to document various prompt injection techniques. We encourage contributions from the community to help expand our knowledge base and share insights on how to mitigate these risks.

📝 Examples

🚧 Work in progress here... 🚧

✍️ Contributing

We highly appreciate contributions from the community. Here’s how you can contribute:

Option 1: Open an Issue

If you have an idea for a new prompt injection technique, idea, or question, feel free to open an issue. We welcome all feedback and suggestions.

Option 2: Submit a Pull Request

If you would like to contribute with code or documentation, you can submit a pull request. Here’s how you can do it:

Fork the repository.
Create a new branch (Example: feature/your-feature).
Commit your changes (Please, use conventional commits conventions).
Push to the branch
Open a Pull Request.

Let’s work together to make prompt.fail a valuable resource for the Cybersecurity & AI community!

📜 License

This project is licensed under the GPL-3.0 license. For more information, please refer to the LICENSE file.

Related Projects

spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

16 Mar 2023 1,086

prompt-guard

A Streamlit app for testing Prompt Guard, a classifier model by Meta for detecting prompt attacks.

PurpleLlama

Set of tools to assess and improve LLM security.

06 Dec 2023 2,227

llm_server

Rack API application for Llama.cpp

Promptimizer

An Automated AI-Powered Prompt Optimization Framework

26 Jul 2024 134

open-strawberry

Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI,...

llm-prompt-templates

Empower your LLM to do more than you ever thought possible with these state-of-the-art prompt tem...

Get-Things-Done-with-Prompt-Engineering-and-LangChain

LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with cus...

12 Apr 2023 1,094

SyntaxShaper

Powering Agent Chains by Constraining LLM Outputs

llama-guard-prompt-utils

Prompt utilities for llama-guard. Use MLCommons taxonomies or build your own safety categories.

HackBot

AI-powered cybersecurity chatbot designed to provide helpful and accurate answers to your cyberse...

30 Jul 2023 259

prompt_injection

Prompt injection techniques

llm-chain

`llm-chain` is a powerful rust crate for building chains in large language models allowing you to...

24 Mar 2023 1,322

LLM-Finetuning-Toolkit

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

24 Jul 2023 755

pyllamacpp

Python bindings for llama.cpp