Read original ↗
paperarXivTrust 82 · PrimaryPublished 3d agoLive · 2d ago

Self-Study Reconsidered: The Hidden Fragility of Learning from Self-Generated QA

Language models are increasingly taught from synthetic question--answer (QA) supervision: a model generates questions about a document, answers them from the same text, and the resulting pairs are used to fine-tune, distill, or compress knowledge into another model. We show that this generation step is not neutral preprocessing. It is an implicit policy that both selects which evidence becomes training signal and decides how that evidence is answered, and it is fragile at both stages. When choosing what to ask, generators do not scan a document uniformly. Coverage saturates early and concentra

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

Related across the graph

Topics