LLM-Security

A Security Testing Suite for Large Language Models

Introduction

LLM-Security is my personal security benchmark for testing large language models. It contains various exploits which make the models behave maliciously.

Warning

This collection is intended for educational purposes only. Do NOT use it for illegal activities.

Getting Started

To get started with LLM-Security, follow these steps:

Clone the repository

git clone https://github.com/nnxmms/LLM-Security.git

Change directory to the downloaded repository

cd LLM-Security

Create a virtual environment

virtualenv -p python3.11 env

Activate the environment

source ./env/bin/activate

Install requirements

pip install -r requirements.txt

Register at OpenAI to obtain a OPENAI_API_KEY
Create a .env file and update the values

cp .env.example .env

Usage

Now you can run the benchmark with the following command

python3 benchmark.py

The results will be stored in a benchmark.json file

OpenAI Example .env

# General
VENDOR=openai
MODELNAME=gpt-3.5-turbo

# OpenAI
OPENAI_API_KEY=sk-...

Ollama Example .env

# General
VENDOR=ollama
MODELNAME=llama3:instruct

# OpenAI
OPENAI_API_KEY=

Exploits

This table provides an overview of all exploits that are used within this benchmark.

Paper	Link
Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation	Hacking and Security - Persona Modulatoin
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs	Hacking and Security - ArtPrompt

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
benchmark.py		benchmark.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.env.example

.env.example

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

benchmark.py

benchmark.py

requirements.txt

requirements.txt

Repository files navigation

LLM-Security

Introduction

Getting Started

Usage

OpenAI Example .env

Ollama Example .env

Exploits

About

Releases

Packages

Languages

License

nnxmms/LLM-Security

Folders and files

Latest commit

History

Repository files navigation

LLM-Security

Introduction

Getting Started

Usage

OpenAI Example .env

Ollama Example .env

Exploits

About

Topics

Resources

License

Stars

Watchers

Forks

Languages