repoGitHubTrust 82 · PrimaryPublished 19h agoLive · 15h ago
alibaba/rtp-llm
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
Covers
newsOpenAI and Broadcom announce chip designed for LLM inference at scalenewsOpenAI and Broadcom unveil LLM-optimized inference chipnewsHardware startup unveils inference acceleratornewsDSpark: Speculative decoding accelerates LLM inference [pdf]newsDeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]
Related across the graph
newsOpenAI and Broadcom announce chip designed for LLM inference at scalenewsOpenAI and Broadcom unveil LLM-optimized inference chipnewsDSpark: Speculative decoding accelerates LLM inference [pdf]newsDeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]newsHardware startup unveils inference accelerator
