Angestrom
Search
Papers
Models
Live AI
Intelligence
Search
⌕
Go
⌘K
More
▾
Enterprise
Pricing
Sign in
≡
Home
/
Topics
/
inference
Topic cluster · 1 items
inference
paper
Speculative decoding with draft models
Accelerating generation by drafting tokens with a small model.
Related topics
efficiency (1)
✦