Topic

Llm Training

5 items across the graph — tagged with Llm Training.

From the graph · 5

Democratizing Reinforcement Learning for LLMs

Nvidia GPU exporter for prometheus using nvidia-smi binary

A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation (OPD) of large language models

逐行对照 MiniMind 源码精读、并延伸到大模型技术体系的中文学习笔记 —— 预训练 / SFT / DPO / PPO / GRPO、训练机制、MiniMind2→3 版本对照、真实实验证据。

Qelm - Quantum Enhanced Language Model