Read original ↗
paperarXivTrust 82 · PrimaryPublished 2d agoLive · yesterday

MemSyco-Bench: Benchmarking Sycophancy in Agent Memory

Memory has emerged as a cornerstone of modern LLM-based agents, supporting their evolution from single-turn assistants to long-term collaborators. However, memory is not always beneficial: retrieved memories often induce a critical issue of sycophancy, causing agents to over-align with the user at the cost of factual accuracy or objective reasoning. Despite this emerging risk, existing memory benchmarks primarily evaluate whether memories are correctly stored, retrieved, or updated, while overlooking how retrieved memories influence downstream reasoning and decision-making. To bridge this gap,

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Implements

Covers

Covers (incoming)

Implements (incoming)

Related across the graph

Topics