newsReddit r/LocalLLaMATrust 58 · CommunityPublished 3d agoLive · 3d ago
ascend-tribe/openPangu-2.0-Flash (They haven't uploaded it to Huggingface yet)
https://ai.gitcode.com/ascend-tribe/openPangu-2.0-Flash openPangu-2.0-Flash is an MoE model trained on Ascend. The model has 92B total parameters and 6B activated parameters. Its context length is 512k. The total pretraining data contains 34T tokens. During Post-training, openPangu-2.0-Flash is trained through unified SFT with slow and fast thinking capability, multiple specialist RL tr
