repoGitHubTrust 82 · PrimaryPublished 15h agoLive · 13h ago
manjunathshiva/turboquant-mlx
Extreme weight + KV cache compression for LLMs on Apple Silicon (MLX implementation of Google's TurboQuant)
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
