Topic

Speed

5 items across the graph — tagged with Speed.

From the graph · 5

LMCache: Supercharge Your LLM with the Fastest KV Cache Layer

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

TokenSpeed is a speed-of-light LLM inference engine.

A PyTorch native library for training speculative decoding models