paperarXivTrust 82 · PrimaryPublished 7d agoLive · 4d ago
Position Bias Correction is Insufficient for One-Pass Attention Sorting
Long-context language models suffer from position bias, where information in middle positions is underutilized. Attention Sorting addresses this by iteratively reordering documents based on attention patterns, but its multiple sort-and-generate cycles increase deployment cost. We hypothesize that position bias is the primary bottleneck and propose Debiased One-Pass Attention Sorting, which estimates a per-prompt position-bias curve from the low-attention majority of documents and uses it to correct raw attention scores (via subtraction or division) to enable single-pass sorting. Our experiment
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
