prompt-guard

Prompt Guard is a classifier model by Meta, trained on a large corpus of attacks, capable of detecting both explicitly malicious prompts (jailbreaks) as well as data that contains injected inputs (prompt injections). Upon analysis, it returns one or more of the following verdicts, along with a confidence score for each:

INJECTION
JAILBREAK
BENIGN

This repository contains a Streamlit app for testing Prompt Guard. Note that you'll need an HuggingFace access token to access the model.

Here's a sample response by Prompt Guard upon detecting a prompt injection attempt.

Here's a sample response by Prompt Guard upon detecting a jailbreak attempt.

Related Projects

ComfyUI-N-Nodes

A suite of custom nodes for ConfyUI that includes GPT text-prompt generation, LoadVideo, SaveVide...

06 Aug 2023 201

nanogptjs

Interact with NanoGPT's API for pay-per-prompt interaction with AI models

19 Aug 2024 4

llama-guard-prompt-utils

Prompt utilities for llama-guard. Use MLCommons taxonomies or build your own safety categories.

08 Aug 2024 0

HackBot

AI-powered cybersecurity chatbot designed to provide helpful and accurate answers to your cyberse...

30 Jul 2023 259

PurpleLlama

Set of tools to assess and improve LLM security.

06 Dec 2023 2,227

prompt.fail

prompt.fail explores prompt injection techniques in large language models (LLMs), providing examp...

06 Jul 2024 6

llama.cpp-ts

llama.cpp 🦙 LLM inference in TypeScript

27 Jul 2024 0

Get-Things-Done-with-Prompt-Engineering-and-LangChain

LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with cus...

12 Apr 2023 1,094

genaiscript

Generative AI Scripting for VSCode

17 Aug 2023 56

dynamic_prompting

Dynamic Few-Shot Prompting is a Python package that dynamically selects N samples that are contex...

03 Jul 2024 3

CandyLLM

A simple, easy-to-use framework for HuggingFace and OpenAI text-generation models.

09 Jul 2024 2

prompt_injection

Prompt injection techniques

16 Aug 2024 1

plock

From anywhere you can type, query and stream the output of an LLM or any other script

21 Jan 2024 446

AI-Prompts

⭐ A fast and lightweight web application that lists useful and favorite AI/GPT prompts ⭐

12 Aug 2024 3

cappr

Completion After Prompt Probability. Make your LLM make a choice

22 Feb 2023 68