paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago

ManimAgent: Self-Evolving Multimodal Agents for Visual Education

Multi-round reflection lets agents built on large language models recover from failures within a single task, but each task remains an isolated episode: lessons learned across many reflection rounds on one task are discarded before the next begins. We study this gap on a code-generation task: from a scientific paper section, the agent writes Python in the open-source Manim library to render a mathematical animation. We present ManimAgent, a self-evolving multimodal agent that carries reflection experience across tasks through a dual-channel Episodic Memory Bank grown entirely from its own task

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

newsCheck out real-life AI prototypes from the Futures Lab.newsAI coding agents taught robots how to install GPUs and cut zip ties

Covers (incoming)

newsVibe Coding / Agentic workflow

Implements (incoming)

repoai-collection/ai-collection repoEvolvingLMMs-Lab/LLaVA-OneVision-2 repojmerelnyc/Photo-agents repoJosephOIbrahim/Comfy-Cozy

Related across the graph

newsAI coding agents taught robots how to install GPUs and cut zip ties newsCheck out real-life AI prototypes from the Futures Lab.repoEvolvingLMMs-Lab/LLaVA-OneVision-2 repoai-collection/ai-collection newsVibe Coding / Agentic workflow repojmerelnyc/Photo-agents repoJosephOIbrahim/Comfy-Cozy

Topics

cs.AI