repoGitHubTrust 82 · PrimaryPublished 2d agoLive · 2d ago
Emmimal/prompt-regression-suite
Detect prompt regressions before they reach production — per-category accuracy scoring, deterministic validation, and False Improvement detection. Pure Python, zero dependencies.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
