repoGitHubTrust 82 · PrimaryPublished 4h agoLive · 59m ago
peremartra/Rearchitecting-LLMs
Official code for the Manning book on structural LLM optimization: depth/width pruning, knowledge distillation, and attention optimization, runnable on free Colab GPUs.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Related to
Covers
Covers (incoming)
Related across the graph
newsThe gap between open weights LLMs and closed source LLMsnewsHow're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]newsI shrank a transformer until every number fitted on the screen and made the weights editable [R]newsH64LM: A 249M-parameter Mixture-of-Experts Transformer built from scratch in PyTorch [P]tutorialEvaluate a model properly
