paperarXivTrust 82 · PrimaryPublished 5d agoLive · 3d ago
Covering the Unseen: Information Demand Coverage Optimization for Retrieval-Augmented Generation
Retrieval-augmented generation (RAG) typically treats context selection as ranking chunks against a single query embedding. This assumption breaks down for complex queries, such as multi-hop or ambiguous questions, where top-k selection tends to over-cover one semantic aspect while ignoring critical sub-questions. We propose GeoRAG, which recasts context selection as Information Demand Coverage Optimization. GeoRAG builds a multi-dimensional demand distribution through diverse sub-query generation and reverse-validation weighting, then selects context by minimizing the Sinkhorn-Wasserstein dis
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
