Topic

Turboquant

1 items across the graph — tagged with Turboquant.

From the graph · 1

Extreme weight + KV cache compression for LLMs on Apple Silicon (MLX implementation of Google's TurboQuant)