Read original ↗
paperarXivTrust 82 · PrimaryPublished yesterdayLive · 7h ago

Neuron-Aware Data Selection for Annotation-Free LLM Self-Distillation

Post-training large language models (LLMs) without real-world interaction feedback or human-labeled supervision remains challenging, particularly in specialized domains where expert annotations are costly to obtain. Recent annotation-free self-evolution methods address this by using the model's own outputs as supervision signals, constructing a teacher via additional context and aggregating predictions across multiple rollouts through majority voting to produce pseudo-labels. However, these approaches are not without drawbacks: SFT- and GRPO-based variants suffer out-of-domain performance degr

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Why these links exist

  • Linked via arxiv authorZhuowei Chen

    Neuron-Aware Data Selection for Annotation-Free LLM Self-Distillation

  • Linked via arxiv authorXiang Lorraine Li

    Neuron-Aware Data Selection for Annotation-Free LLM Self-Distillation

Implements

Covers

authored (incoming)

Related across the graph

Topics