Llm Training
5 items across the graph — tagged with Llm Training.
From the graph · 5
repo
rllm-org/rllm
→repoDemocratizing Reinforcement Learning for LLMs
utkuozdemir/nvidia_gpu_exporter
→repoNvidia GPU exporter for prometheus using nvidia-smi binary
chrisliu298/awesome-on-policy-distillation
→repoA curated collection of papers, technical reports, frameworks, and tools for on-policy distillation (OPD) of large language models
Enping-Hu/minimind-deep-dive
→repo逐行对照 MiniMind 源码精读、并延伸到大模型技术体系的中文学习笔记 —— 预训练 / SFT / DPO / PPO / GRPO、训练机制、MiniMind2→3 版本对照、真实实验证据。
R-D-BioTech-Alaska/Qelm
→Qelm - Quantum Enhanced Language Model
