paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago

Know Before You Fetch: Calibrated Retrieval-Budget Allocation for Retrieval-Augmented Generation

Retrieval-augmented generation (RAG) typically retrieves a fixed number of passages for every query. This is wasteful when the reader already knows the answer, and it can be harmful when irrelevant or partially relevant passages distract the reader. We formulate adaptive RAG as calibrated retrieval-budget allocation: given a query, decide whether to answer closed-book, retrieve a compact context (k=1), retrieve a full context (k=5), or abstain. The contribution is a probability interface rather than a new raw uncertainty signal. We calibrate sequence log-probability and prefix-logit uncertaint

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

newsRAGless: Q-Q retrieval with score aggregation for closed-domain FAQ [P]

Related across the graph

newsRAGless: Q-Q retrieval with score aggregation for closed-domain FAQ [P]

Topics

cs.CL