Read original ↗
paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago

DAIN: Dynamic Agent-Based Interaction Network for Efficient and Collaborative Multimodal Reasoning

Current multimodal fusion approaches, particularly those based on static Mixture-of-Experts (MoE) architectures, often struggle to provide the adaptive and efficient collaborative reasoning required by complex real-world applications. We introduce the Dynamic Agent-based Interaction Network (DAIN), which reconceptualizes multimodal fusion as a dynamic, multi-agent collaborative process. DAIN employs a context-aware Meta-Controller that dynamically schedules sparse activation of specialized interaction agents and orchestrates compressed inter-agent communication for consensus-building. The fram

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

Implements

Has model

Covers (incoming)

Implements (incoming)

Related across the graph

Topics