Can Agents Generalize to the Open World? Unveiling the Fragility of Static Training in Tool Use
While Large Language Model (LLM) agents demonstrate proficiency in static benchmarks, their deployment in real-world scenarios is hindered by the dynamic nature of user queries, tool sets, and interaction dynamics. To address this generalization gap, we formalize OpenAgent (Tool-Use Agent in Open-World), a problem setting characterized by distributional shifts across query, action, observation, and domain dimensions. To systematically diagnose its impact, we construct a controlled sandbox environment where we define fine-grained environmental shifts across a four-tier hierarchy, Perception, In
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Why these links exist
- Linked via arxiv authorSong-Lin Lv →
Can Agents Generalize to the Open World? Unveiling the Fragility of Static Training in Tool Use
- Linked via arxiv authorWeiming Wu →
Can Agents Generalize to the Open World? Unveiling the Fragility of Static Training in Tool Use
- Linked via arxiv authorRui Zhu →
Can Agents Generalize to the Open World? Unveiling the Fragility of Static Training in Tool Use
- Linked via arxiv authorZi-Jian Cheng →
Can Agents Generalize to the Open World? Unveiling the Fragility of Static Training in Tool Use
- Linked via arxiv authorLan-Zhe Guo →
Can Agents Generalize to the Open World? Unveiling the Fragility of Static Training in Tool Use
