Moe
2 items across the graph — tagged with Moe.
From the graph · 2
model
zai-org/GLM-5.2
→repoHugging Face model with 3342 likes. Tags: transformers, safetensors, glm_moe_dsa, text-generation, conversational, en, zh, arxiv:2602.15763, arxiv:2603.12201, l…
Enping-Hu/minimind-deep-dive
→逐行对照 MiniMind 源码精读、并延伸到大模型技术体系的中文学习笔记 —— 预训练 / SFT / DPO / PPO / GRPO、训练机制、MiniMind2→3 版本对照、真实实验证据。
