The forward and reverse process, with runnable code.
Step through attention, MLPs, and training on a toy task.