Skip to content

A curated list of MLSecOps tools, articles and other resources on security applied to Machine Learning and MLOps systems.

License

Notifications You must be signed in to change notification settings

RiccardoBiosas/awesome-MLSecOps

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 

Repository files navigation

Awesome Maintenance GitHub GitHub

Awesome MLSecOps

A curated list of awesome open-source tools, resources, and tutorials for MLSecOps (Machine Learning Security Operations).

mm.png

Table of Contents

Open Source Security Tools

  • ModelScan - Protection Against ML Model Serialization Attacks.
  • NB Defense - Secure Jupyter Notebooks.
  • Garak - LLM vulnerability scanner.
  • Adversarial Robustness Toolbox - A library of defense methods for machine learning models against adversarial attacks.
  • MLSploit - MLsploit is a cloud framework for interactive experimentation with adversarial machine learning research.
  • TensorFlow Privacy - A library of privacy-preserving machine learning algorithms and tools.
  • Foolbox - A Python toolbox for creating and evaluating adversarial attacks and defenses.
  • Advertorch - A Python toolbox for adversarial robustness research.
  • Artificial Intelligence Threat Matrix - A framework for identifying and mitigating threats to machine learning systems.
  • Adversarial ML Threat Matrix - Adversarial Threat Landscape for AI Systems.
  • CleverHans - A library of adversarial examples and defenses for machine learning models.
  • AdvBox - Advbox is a toolbox to generate adversarial examples that fool neural networks in PaddlePaddle、PyTorch、Caffe2、MxNet、Keras、TensorFlow.
  • Audit AI - Bias Testing for Generalized Machine Learning Applications.
  • Deep Pwning - Deep-pwning is a lightweight framework for experimenting with machine learning models with the goal of evaluating their robustness against a motivated adversary.
  • Privacy Meter - An open-source library to audit data privacy in statistical and machine learning algorithms.
  • TensorFlow Model Analysis - A library for analyzing, validating, and monitoring machine learning models in production.
  • PromptInject - A framework that assembles adversarial prompts.
  • TextAttack - TextAttack is a Python framework for adversarial attacks, data augmentation, and model training in NLP.
  • OpenAttack - An Open-Source Package for Textual Adversarial Attack.
  • TextFooler - A Model for Natural Language Attack on Text Classification and Inference.
  • Flawed Machine Learning Security - Practical examples of "Flawed Machine Learning Security" together with ML Security best practice across the end to end stages of the machine learning model lifecycle from training, to packaging, to deployment.
  • Adversarial Machine Learning CTF - This repository is a CTF challenge, showing a security flaw in most (all?) common artificial neural networks. They are vulnerable for adversarial images.
  • Damn Vulnerable LLM Project - A Large Language Model designed for getting hacked
  • Gandalf Lakera - Prompt Injection CTF playground
  • Vigil - LLM prompt injection and security scanner
  • PALLMs (Payloads for Attacking Large Language Models) - list of various payloads for attacking LLMs collected in one place
  • AI-exploits - exploits for MlOps systems. It's not just in the inputs given to LLMs such as ChatGPT
  • Offensive ML Playbook - Offensive ML Playbook. Notes on machine learning attacks and pentesting.
  • AnonLLM - Anonymize Personally Identifiable Information (PII) for Large Language Model APIs.
  • AI Goat - vulnerable LLM CTF challenges.
  • Pyrit - The Python Risk Identification Tool for generative AI.
  • Raze to the Ground: Query-Efficient Adversarial HTML Attacks on Machine-Learning Phishing Webpage Detectors - Source code of the paper "Raze to the Ground: Query-Efficient Adversarial HTML Attacks on Machine-Learning Phishing Webpage Detectors" accepted at AISec '23
  • Giskard - Open-source testing tool for LLM applications
  • Safetensors - Convert pickle to a safe serialization option
  • Citadel Lens- Quality testing of models according to industry standards
  • Model-Inversion-Attack-ToolBox - A framework for implementing Model Inversion attacks
  • NeMo-Guardials - NeMo Guardrails allow developers building LLM-based applications to easily add programmable guardrails between the application code and the LLM.
  • AugLy - A tool for generating adversarial attacks.
  • Knockoffnets - PoC to implement BlackBox attacks to steal model data.
  • Robust Intelligence Continous Validation - Tool for continuous model validation for compliance with standards
  • VGER - Jupyter Attack framework
  • AIShield Watchtower - An open source tool from AIShield for studying AI models and scanning for vulnerabilities.

Commercial

DATA

ML code security

  • lintML - security linter for ML, by Nvidia
  • HiddenLayer: Model as Code - research about some vectors in ml libraries.
  • Copycat CNN - proof-of-concept on how to generate a copy of a Convolutional Neural Network by querying it as a black-box with random data and using the output to train a copycat CNN which mimics the target CNN's predictive patterns.
  • differential-privacy-library - It is a library designed for differential privacy and machine learning. Its goal is to allow experimentation, simulation and implementation of differentially private models.

101

Do you want to learn AI Security but don't know where to start ? Take a look at this map:

map.png

full size in this repository:

Attack Vectors

Blogs and Publications

MlOps infrastracture vulnerabilities

Community Resources

Infographics

MlSecOps lifecycle

MLSecOps_Lifecycle

Ai Security Market map

Market Map

Contributions

All contributions to this list are welcome!

Contributors ✨