repoGitHubTrust 82 · PrimaryPublished 13h agoLive · 12h ago
guoqingbao/xinfer
Blazing-fast LLM inference in pure Rust. No PyTorch and Python runtime.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Covers
newsDSpark: Speculative decoding accelerates LLM inference [pdf]newsProfiling in PyTorch (Part 2): From nn.Linear to a Fused MLPnewsOpenAI and Broadcom announce chip designed for LLM inference at scalenewsWould having a dedicated programming language specifically for LLMs be a viable solution? [D]newsDeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]
Related across the graph
newsProfiling in PyTorch (Part 2): From nn.Linear to a Fused MLPnewsOpenAI and Broadcom announce chip designed for LLM inference at scalenewsWould having a dedicated programming language specifically for LLMs be a viable solution? [D]newsDSpark: Speculative decoding accelerates LLM inference [pdf]newsDeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]
