repoGitHubTrust 82 · PrimaryPublished yesterdayLive · yesterday
chrisliu298/awesome-on-policy-distillation
A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation (OPD) of large language models
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Implements
Covers
Implements (incoming)
Related across the graph
paperMOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-TrainingnewsKnowledge Distillation of Black-Box Large Language ModelspaperDemoPSD: Disagreement-Modulated Policy Self-DistillationpaperPurified OPSD: On-Policy Self-Distillation Without Losing How to ThinkpaperPHF: Privileged Hidden Flow for On-Policy Self-DistillationpaperDistill to Detect: Exposing Stealth Biases in LLMs through Cartridge DistillationnewsKnowledge Distillation of Black-Box Large Language Models (2024)paperNeuron-Aware Data Selection for Annotation-Free LLM Self-Distillation
