Before a model sees any text, it gets chopped into tokens. This is a simplified byte-pair-style merge, showing the core idea behind real tokenizers like the one GPT/Claude use.