paperarXivTrust 82 · PrimaryPublished yesterdayLive · 7h ago

Neuron-Aware Data Selection for Annotation-Free LLM Self-Distillation

Post-training large language models (LLMs) without real-world interaction feedback or human-labeled supervision remains challenging, particularly in specialized domains where expert annotations are costly to obtain. Recent annotation-free self-evolution methods address this by using the model's own outputs as supervision signals, constructing a teacher via additional context and aggregating predictions across multiple rollouts through majority voting to produce pseudo-labels. However, these approaches are not without drawbacks: SFT- and GRPO-based variants suffer out-of-domain performance degr

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Why these links exist

Linked via arxiv authorZhuowei Chen →
Neuron-Aware Data Selection for Annotation-Free LLM Self-Distillation
Linked via arxiv authorXiang Lorraine Li →
Neuron-Aware Data Selection for Annotation-Free LLM Self-Distillation

Implements

repochrisliu298/awesome-llm-unlearning reponick7nlp/Awesome-LLM-On-Policy-Distillation repochrisliu298/awesome-on-policy-distillation

Covers

newsIEEE Rolls Out Large Language Models Virtual Training Course

authored (incoming)

personZhuowei Chen personXiang Lorraine Li

Related across the graph

repochrisliu298/awesome-llm-unlearning repochrisliu298/awesome-on-policy-distillation reponick7nlp/Awesome-LLM-On-Policy-Distillation personZhuowei Chen newsIEEE Rolls Out Large Language Models Virtual Training Course personXiang Lorraine Li

Topics

cs.AI