Skip to main content

Search Papers Models Live AI Intelligence

Search⌕⌘K

Enterprise Pricing Sign in

Stay Ahead in the AI Revolution

Weekly digest — EPI pulse, top intelligence, fresh lineage. Free, no account.

Follow Angestrom

Global source network

Synced every 5 minutes

Continuous sync from primary AI sources — indexed, enriched, and queryable in real time.

arXivHugging FaceGitHubOpenAIAnthropicDeepMindReutersBBC TechHacker NewsReddit MLVerified feedsFunding

ANGESTROM

The Intelligence Layer of Humanity. Everything AI. All in One Place.

Angestrom connects every piece of the AI ecosystem — data, models, research, companies, tools, and people.

info@angestrom.com www.angestrom.comLucknow, Uttar Pradesh, India

Product

AI Search
AI Models
Research Papers
Companies
News & Events
GitHub Explorer
APIs & Tools
Datasets
Benchmarks
Model lifecycle
Funding graph
Contributors
AI Agents

Resources

Weekly digest
Documentation
Tutorials
Guides
News
Help / Start
Community

Company

About
Contact
Privacy Policy
Terms of Service
Acceptable Use

Enterprise

Pricing
Workspace
Contact Sales

Developer

Developer Hub
API docs
GitHub

Learn

Learning Academy
Roadmaps
Glossary
AI for Beginners

Popular Topics

Loading topics…

View All Topics →

© 2026 Angestrom Intelligence Private Limited. All rights reserved.

English

Theme

Search Papers Models Live AI Intelligence

Search⌕⌘K

Enterprise Pricing Sign in

Home
Repositories
kekzl/imp

Read original ↗

repoGitHubTrust 82 · PrimaryPublished yesterdayLive · 21h ago

kekzl/imp

From-scratch C++/CUDA inference engine for the NVIDIA RTX 5090 (sm_120a) — the best single-GPU backend for agentic AI: tool calling, long-context loops, reasoning and concurrent sub-agents on top of the fastest single-stream decode on the 5090 (beats llama.cpp, at-or-ahead of vLLM on NVFP4). 100% written by Claude Code.

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

newsNVIDIA Vera CPU Opens the Way for Agentic Scientific AI at Los Alamos National Laboratory newsNvidia’s AI Hardware Comes to Windows in RTX Spark PCs newsClaude Meets Blackwell Ultra: Anthropic’s Models Now Run on NVIDIA GB300 in Azure newsNVIDIA BioNeMo Agent Toolkit Brings Accelerated AI to Life Sciences Researchers in Claude Science newsBuild real agentic apps using CUGA: two dozen working examples on a lightweight harness

Related across the graph

newsBuild real agentic apps using CUGA: two dozen working examples on a lightweight harness newsClaude Meets Blackwell Ultra: Anthropic’s Models Now Run on NVIDIA GB300 in Azure newsNVIDIA BioNeMo Agent Toolkit Brings Accelerated AI to Life Sciences Researchers in Claude Science newsNvidia’s AI Hardware Comes to Windows in RTX Spark PCs newsNVIDIA Vera CPU Opens the Way for Agentic Scientific AI at Los Alamos National Laboratory

Knowledge path·NBuild real agentic apps using CUGA: two dozen working examples on a lightweight harness→NClaude Meets Blackwell Ultra: Anthropic’s Models Now Run on NVIDIA GB300 in Azure→NNVIDIA BioNeMo Agent Toolkit Brings Accelerated AI to Life Sciences Researchers in Claude Science→Rkekzl/imp

Topics

blackwell cpp cuda fp4 gated-deltanet gguf inference-engine llama-cpp llm llm-inference

Explore

Search similar →Knowledge graph →All repos →Full intelligence feed →

Graph trust82Primary

Graph score29