paperarXivTrust 82 · PrimaryPublished 5d agoLive · 3d ago
PolicyGuard: A Dialogue-Grounded Sub-Agent Verifier for Policy Adherence in LLM Agents
LLM agents handle user requests on behalf of organizations through tool calls and must follow the company policies stated in their system prompts. Prior work approaches this as a safeguarding problem -- external checks that block non-compliant agent actions. We argue that policy adherence is a broader problem: real workflows unfold across many turns, require explicit user confirmation and prerequisite reads, and hinge on the content of the dialogue rather than on any single argument value. Meeting this bar requires (i) full conversation context, (ii) self-reasoning over the policy and the curr
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Implements
Covers
Covers (incoming)
Related across the graph
newsPrompt injection is exploiting enterprise AI's biggest design flaws by targeting agents, RAG pipelines and model routersnewsA system-level approach to prompt injection: separating instruction and data channels in LLM agents [P]newsProduction-grade AI agents for financial compliance: Lessons from Striperepoagent-tools
