Read original ↗
newsHacker NewsTrust 72 · CommunityPublished 5d agoLive · 5d ago

Show HN: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch

Hi everyone, I started working on nanoeuler after the ban of anthropic's fable because my ambition and dream is to work in the AI field in anthropic. The two interesting reasons that led me to create nanoeuler were (1) interfacing with llm does not mean understanding how they are composed and (2), working on llm with a very low-level layer to understand the correlation between parameters and data and growth of the model and how the GPU works and how some layers can be optimized. So I st

Covers (incoming)

Related across the graph