person profile

Gianmarco Spinaci

Gianmarco Spinaci — researcher or builder tracked in the Angestrom contributor network.

3Connections

1Papers

0Models

0Repos

0News

Papers · 1

EduArt: An educational-level benchmark for evaluating art history knowledge in large language models

Large language models now score near ceiling on general benchmarks, but these aggregate measures reveal little about how models behave within single disciplines. Existing art-focused evaluations rely on synthetic questions and rarely report item-level properties. This paper introduces EduArt, an educational-level benchmark for art-historical knowledge and visual reasoning in multimodal LLMs. EduArt comprises 871 human-authored questions from Italian secondary-school exercises and US Advanced Placement Art History exams, spanning two languages and seven formats from multiple choice to in-text w