Read original ↗
newsHacker NewsTrust 72 · CommunityPublished 6d agoLive · 6d ago

DSpark: Speculative decoding accelerates LLM inference [pdf]

717points294comments

Covers

Covers (incoming)

Related across the graph