Read original ↗newsReddit r/LocalLLaMATrust 58 · CommunityPublished 8d agoLive · 7d ago[Research] JetSpec: Speculative Decoding with Parallel Tree Drafting Enables up to 9.64x Lossless LLM Inference Speedup with more than 1000TPS…✦Explain this simplyOpen SourceReddit r/LocalLLaMACoverspaperSpeculative decoding with draft modelsCovers (incoming)paperDepth Exploration for LLM DecodingpaperBlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decodingreposgl-project/SpecForgeRelated across the graphpaperBlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decodingreposgl-project/SpecForgepaperDepth Exploration for LLM DecodingpaperSpeculative decoding with draft modelsKnowledge path·PBlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding→Rsgl-project/SpecForge→PDepth Exploration for LLM Decoding→N[Research] JetSpec: Speculative Decoding with Parallel Tree Drafting Enables up to 9.64x Lossless LLM Inference Speedup with more than 1000TPS⧉↗ share