Read original ↗
paperarXivTrust 82 · PrimaryPublished 3d agoLive · 2d ago

Modality-Driven Search with Holistic Trace Judging for ARC-AGI-2

Large language models can produce fluent, internally coherent reasoning traces for abstract reasoning tasks while still being confidently wrong - making selection among candidates, not just generation, the central challenge. I present a solver for ARC-AGI-2, a few-shot visual reasoning benchmark, built around two principles: (i) treating reasoning modalities as search operators, generating diverse candidates independently across text, image, and code channels, and (ii) context-preserving holistic judging, in which a judge model jointly compares all candidate reasoning traces within a single lo

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

Related to

Covers (incoming)

Related across the graph

Topics