repoGitHubTrust 82 · PrimaryPublished yesterdayLive · 16h ago
promptfoo/promptfoo
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, DeepSeek, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Related to
Covers
Implements
Related across the graph
newsPrompt injection is exploiting enterprise AI's biggest design flaws by targeting agents, RAG pipelines and model routerstoolPromptLabpaperSWE-Doctor: Guiding Software Engineering Agents with Runtime Diagnosis from Multi-Faceted Bug Reproduction TestsnewsA system-level approach to prompt injection: separating instruction and data channels in LLM agents [P]toolAgentTrace
