Read original ↗
paperarXivTrust 82 · PrimaryPublished yesterdayLive · 21h ago

Automated grading of Linux/bash examinations using large language models: a four-level cognitive taxonomy approach

Scalable and reliable grading of command-line examinations remains a challenge in computing education, where rising enrolments make manual marking difficult and rule-based autograders cannot handle partial credit, equivalent solutions, or syntactic variation. This paper evaluates whether four frontier Large Language Models (GPT, Claude Opus, Gemini, and GLM) can approximate expert judgment when grading short Linux/bash command responses. The study adopts a four-level cognitive taxonomy that combines cognitive complexity and operational impact, ranging from information retrieval (L1) and basic

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Why these links exist

  • Linked via arxiv authorManuel Alonso-Carracedo

    Automated grading of Linux/bash examinations using large language models: a four-level cognitive taxonomy approach

  • Linked via arxiv authorRuben Fernandez-Boullon

    Automated grading of Linux/bash examinations using large language models: a four-level cognitive taxonomy approach

  • Linked via arxiv authorPedro Celard

    Automated grading of Linux/bash examinations using large language models: a four-level cognitive taxonomy approach

  • Automated grading of Linux/bash examinations using large language models: a four-level cognitive taxonomy approach

  • Linked via arxiv authorLorena Otero-Cerdeira

    Automated grading of Linux/bash examinations using large language models: a four-level cognitive taxonomy approach

authored (incoming)

Related across the graph

Topics