repo · GitHub
engineering87/llm-atlas
Interactive, in-browser visualization of how a transformer language model works: tokens, attention, quantization, and sampling, rendered live.
Want the primary source?View original →
Interactive, in-browser visualization of how a transformer language model works: tokens, attention, quantization, and sampling, rendered live.