Read original ↗
paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago

ManimAgent: Self-Evolving Multimodal Agents for Visual Education

Multi-round reflection lets agents built on large language models recover from failures within a single task, but each task remains an isolated episode: lessons learned across many reflection rounds on one task are discarded before the next begins. We study this gap on a code-generation task: from a scientific paper section, the agent writes Python in the open-source Manim library to render a mathematical animation. We present ManimAgent, a self-evolving multimodal agent that carries reflection experience across tasks through a dual-channel Episodic Memory Bank grown entirely from its own task

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

Covers (incoming)

Implements (incoming)

Related across the graph

Topics