promptfoo
by promptfoo
Promptfoo is a developer-friendly tool for testing and securing large language model (LLM) applications through automated evaluations and red teaming.
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
Primary Use Case
This tool is primarily used by developers and security professionals to evaluate the performance and security of AI applications powered by LLMs. It enables users to conduct vulnerability scans and compare different models to ensure the reliability and safety of AI systems.
- Automated evaluations for prompts and models
- Red teaming and vulnerability scanning for LLM applications
- Side-by-side comparison of various AI models
- CI/CD integration for automated checks
- Local execution ensuring privacy of prompts
- Support for multiple LLM APIs and programming languages
- Generation of security vulnerability reports
Installation
- Install and initialize project using: npx promptfoo@latest init
Usage
>_ npx promptfoo evalRuns your first evaluation of prompts and models.
- Can be chained with Metasploit for automated exploitation
- Useful for continuous security monitoring in CI/CD pipelines
- Encourage collaboration between developers and security teams for effective red teaming
- Integrate with existing security tools to enhance vulnerability scanning capabilities
- Leverage the tool's reporting features to improve security awareness across the organization
Docs Take 2 Hours. AI Takes 10 Seconds.
Ask anything about promptfoo. Installation? Config? Troubleshooting? Get answers trained on real docs and GitHub issues—not generic ChatGPT fluff.
3 free chats per tool • Instant responses • No credit card
Related Tools
PayloadsAllTheThings
swisskyrepo/PayloadsAllTheThings
A list of useful payloads and bypass for Web Application Security and Pentest/CTF

hoppscotch
hoppscotch/hoppscotch
Open source API development ecosystem - https://hoppscotch.io (open-source alternative to Postman, Insomnia)
ImHex
WerWolv/ImHex
🔍 A Hex Editor for Reverse Engineers, Programmers and people who value their retinas when working at 3 AM.

termux-app
termux/termux-app
Termux - a terminal emulator application for Android OS extendible by variety of packages.

sentry
getsentry/sentry
Developer-first error tracking and performance monitoring
CheatSheetSeries
OWASP/CheatSheetSeries
The OWASP Cheat Sheet Series was created to provide a concise collection of high value information on specific application security topics.
