paperarXivTrust 82 · PrimaryPublished 7d agoLive · 4d ago

Position Bias Correction is Insufficient for One-Pass Attention Sorting

Long-context language models suffer from position bias, where information in middle positions is underutilized. Attention Sorting addresses this by iteratively reordering documents based on attention patterns, but its multiple sort-and-generate cycles increase deployment cost. We hypothesize that position bias is the primary bottleneck and propose Debiased One-Pass Attention Sorting, which estimates a per-prompt position-bias curve from the low-attention majority of documents and uses it to correct raw attention scores (via subtraction or division) to enable single-pass sorting. Our experiment

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Implements

repoattention-zoo

Covers

newsBreakthrough in long-context efficiency announced

Related to

glossary_termAttention

Related across the graph

newsBreakthrough in long-context efficiency announced glossary_termAttention repoattention-zoo

Topics

cs.CL