paperarXivTrust 82 · PrimaryPublished 2d agoLive · yesterday
MemSyco-Bench: Benchmarking Sycophancy in Agent Memory
Memory has emerged as a cornerstone of modern LLM-based agents, supporting their evolution from single-turn assistants to long-term collaborators. However, memory is not always beneficial: retrieved memories often induce a critical issue of sycophancy, causing agents to over-align with the user at the cost of factual accuracy or objective reasoning. Despite this emerging risk, existing memory benchmarks primarily evaluate whether memories are correctly stored, retrieved, or updated, while overlooking how retrieved memories influence downstream reasoning and decision-making. To bridge this gap,
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Implements
Covers
Covers (incoming)
Implements (incoming)
repoplur-ai/plurrepomem0ai/mem0repoOpen-Curiosity/gini-agentrepojoshuaswarren/remnicrepoactiveloopai/hivemindreporajkripal/cashewrepoMemPalace/mempalacerepoTeleAI-UAGI/Awesome-Agent-Memoryrepothedotmack/claude-memrepobasicmachines-co/basic-memoryrepoplastic-labs/honchorepoGuyMannDude/mnemo-cortexrepoAIPMAndy/dna-memoryrepomirkofr/FERNmerepoRexodus/anamnesion-memory-serverrepoEverMind-AI/RavenrepoMemTensor/MemOSreposyncable-dev/memtrace-publicreponambok/mentedbrepogowtham0992/link
Related across the graph
repoAIPMAndy/dna-memoryrepobasicmachines-co/basic-memorynewsI built an open-source memory governance layer for AI assistants - looking for technical feedback [P]repomem0ai/mem0repoactiveloopai/hivemindrepoGuyMannDude/mnemo-cortexrepogowtham0992/linkrepoplastic-labs/honchorepojoshuaswarren/remnicreporajkripal/cashewrepomirkofr/FERNmenewsStructured memory filtering with metadata in AgentCore MemorynewsEvaluating long-term memory limits in stateless LLM chatbots — feedback needed [D]repoRexodus/anamnesion-memory-serverreposyncable-dev/memtrace-publicrepoEverMind-AI/Ravenrepothedotmack/claude-memrepoOpen-Curiosity/gini-agentreponambok/mentedbrepolas7/memharnessrepoTeleAI-UAGI/Awesome-Agent-MemoryrepoMemTensor/MemOSrepoMemPalace/mempalacerepoplur-ai/plurrepoNoshkoto/Noshyrepoagent-tools
