Read original ↗

newsHacker NewsTrust 52 · CommunityPublished 4d agoLive · 4d ago

Knowledge Distillation of Black-Box Large Language Models

48points12comments

Open Source Hacker News verified also-covered-by:Hacker News

Covers

paperHow Surprising Is Historical Italian to Language Models? Tokenization Tax, Comprehension Tax, and a Simple Mitigation paperNuclearQAv2: A Structured Benchmark for Evaluating Domain-Science Competence in Large Language Models glossary_termTransformer repoengineering87/llm-atlas repominimal-diffusion-lm

Covers (incoming)

paperWhen are likely answers right? On Sequence Probability and Correctness in LLMs paperScaling limit of the Random Language Model paperAB-RAG: Adaptive Budgeted Retrieval-Augmented Generation for Reliable Question Answering paperLittle Brains, Big Feats: Exploring Compact Language Models paperEfficient Retrieval-Augmented Generation via Token Co-occurrence Graphs paperGrounding LLM Reasoning under Incomplete Graph Evidence paperThe Model Organism Lottery: Model Organism Interpretability Strongly Depends on Training Methodology repochrisliu298/awesome-on-policy-distillation repochrisliu298/awesome-llm-unlearning paperObject Aligner: A Configurable JSON Schema Similarity Score for Graphs, Applied to LLM Prompt Optimization paperEduArt: An educational-level benchmark for evaluating art history knowledge in large language models reponick7nlp/Awesome-LLM-On-Policy-Distillation

Related across the graph

repochrisliu298/awesome-llm-unlearning paperLittle Brains, Big Feats: Exploring Compact Language Models paperObject Aligner: A Configurable JSON Schema Similarity Score for Graphs, Applied to LLM Prompt Optimization repochrisliu298/awesome-on-policy-distillation paperWhen are likely answers right? On Sequence Probability and Correctness in LLMs glossary_termTransformer repominimal-diffusion-lm repoengineering87/llm-atlas paperAB-RAG: Adaptive Budgeted Retrieval-Augmented Generation for Reliable Question Answering paperGrounding LLM Reasoning under Incomplete Graph Evidence reponick7nlp/Awesome-LLM-On-Policy-Distillation paperEfficient Retrieval-Augmented Generation via Token Co-occurrence Graphs paperEduArt: An educational-level benchmark for evaluating art history knowledge in large language models paperHow Surprising Is Historical Italian to Language Models? Tokenization Tax, Comprehension Tax, and a Simple Mitigation paperThe Model Organism Lottery: Model Organism Interpretability Strongly Depends on Training Methodology paperNuclearQAv2: A Structured Benchmark for Evaluating Domain-Science Competence in Large Language Models paperScaling limit of the Random Language Model