repoGitHubTrust 82 · PrimaryPublished 4h agoLive · 59m ago

peremartra/Rearchitecting-LLMs

Official code for the Manning book on structural LLM optimization: depth/width pruning, knowledge distillation, and attention optimization, runnable on free Colab GPUs.

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Related to

tutorialEvaluate a model properly

Covers

newsI shrank a transformer until every number fitted on the screen and made the weights editable [R]newsThe gap between open weights LLMs and closed source LLMs newsHow're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]

Covers (incoming)

newsH64LM: A 249M-parameter Mixture-of-Experts Transformer built from scratch in PyTorch [P]

Related across the graph

newsThe gap between open weights LLMs and closed source LLMs newsHow're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]newsI shrank a transformer until every number fitted on the screen and made the weights editable [R]newsH64LM: A 249M-parameter Mixture-of-Experts Transformer built from scratch in PyTorch [P]tutorialEvaluate a model properly

Topics