Skip to main content
Angestrom home
SearchPapersModelsLive AIIntelligence
Search⌕⌘K
EnterprisePricingSign in

Stay Ahead in the AI Revolution

Weekly digest — EPI pulse, top intelligence, fresh lineage. Free, no account.

Follow Angestrom
Global source network
Synced every 5 minutes

Continuous sync from primary AI sources — indexed, enriched, and queryable in real time.

arXivHugging FaceGitHubOpenAIAnthropicDeepMindReutersBBC TechHacker NewsReddit MLVerified feedsFunding
ANGESTROM

The Intelligence Layer of Humanity. Everything AI. All in One Place.

Angestrom connects every piece of the AI ecosystem — data, models, research, companies, tools, and people.

info@angestrom.comwww.angestrom.comLucknow, Uttar Pradesh, India

Product

  • AI Search
  • AI Models
  • Research Papers
  • Companies
  • News & Events
  • GitHub Explorer
  • APIs & Tools
  • Datasets
  • Benchmarks
  • Model lifecycle
  • Funding graph
  • Contributors
  • AI Agents

Resources

  • Weekly digest
  • Documentation
  • Tutorials
  • Guides
  • News
  • Help / Start
  • Community

Company

  • About
  • Contact
  • Privacy Policy
  • Terms of Service
  • Acceptable Use

Enterprise

  • Pricing
  • Workspace
  • Contact Sales

Developer

  • Developer Hub
  • API docs
  • GitHub

Learn

  • Learning Academy
  • Roadmaps
  • Glossary
  • AI for Beginners

Popular Topics

Loading topics…
View All Topics →
© 2026 Angestrom Intelligence Private Limited. All rights reserved.
English
Theme
Angestrom home
SearchPapersModelsLive AIIntelligence
Search⌕⌘K
EnterprisePricingSign in
  1. Home
  2. /Repositories
  3. /Zefan-Cai/R-KV
Read original ↗
repoGitHubTrust 82 · PrimaryPublished yesterdayLive · yesterday

Zefan-Cai/R-KV

[Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

newsNew benchmark exposes reasoning gaps in top modelsnewsDeepSeek-V4-Flash (MXFP4): compute buffer scales ~3x just from KV cache quant type (f16 vs q8_0) — anyone else seeing this? Llama.cpp

Related to

modelRetrace-1.5B

Implements

paperCARVE: Content-Aware Recurrent with Value Efficiency for Chunk-Parallel Linear AttentionpaperMessage Passing Enables Efficient Reasoning

Implements (incoming)

paperCheckRLM: Effective Knowledge-Thought Coherence Checking in Retrieval-Augmented Reasoning

Related across the graph

paperCheckRLM: Effective Knowledge-Thought Coherence Checking in Retrieval-Augmented ReasoningnewsDeepSeek-V4-Flash (MXFP4): compute buffer scales ~3x just from KV cache quant type (f16 vs q8_0) — anyone else seeing this? Llama.cpppaperMessage Passing Enables Efficient ReasoningpaperCARVE: Content-Aware Recurrent with Value Efficiency for Chunk-Parallel Linear AttentionmodelRetrace-1.5BnewsNew benchmark exposes reasoning gaps in top models
Knowledge path·PCheckRLM: Effective Knowledge-Thought Coherence Checking in Retrieval-Augmented Reasoning→NDeepSeek-V4-Flash (MXFP4): compute buffer scales ~3x just from KV cache quant type (f16 vs q8_0) — anyone else seeing this? Llama.cpp→PMessage Passing Enables Efficient Reasoning→RZefan-Cai/R-KV

Topics

kvcachellmreasoning-models

Explore

Search similar →Knowledge graph →All repos →Full intelligence feed →
Graph trust82Primary
Graph score1204