Interactive, in-browser visualization of how a transformer language model works: tokens, attention, quantization, and sampling, rendered live.