paperarXivTrust 82 · PrimaryPublished 5d agoLive · 3d ago

PolicyGuard: A Dialogue-Grounded Sub-Agent Verifier for Policy Adherence in LLM Agents

LLM agents handle user requests on behalf of organizations through tool calls and must follow the company policies stated in their system prompts. Prior work approaches this as a safeguarding problem -- external checks that block non-compliant agent actions. We argue that policy adherence is a broader problem: real workflows unfold across many turns, require explicit user confirmation and prerequisite reads, and hinge on the content of the dialogue rather than on any single argument value. Meeting this bar requires (i) full conversation context, (ii) self-reasoning over the policy and the curr

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Implements

repoagent-tools

Covers

newsPrompt injection is exploiting enterprise AI's biggest design flaws by targeting agents, RAG pipelines and model routers newsProduction-grade AI agents for financial compliance: Lessons from Stripe

Covers (incoming)

newsA system-level approach to prompt injection: separating instruction and data channels in LLM agents [P]

Related across the graph

newsPrompt injection is exploiting enterprise AI's biggest design flaws by targeting agents, RAG pipelines and model routers newsA system-level approach to prompt injection: separating instruction and data channels in LLM agents [P]newsProduction-grade AI agents for financial compliance: Lessons from Stripe repoagent-tools

Topics

cs.AI