paperarXivTrust 82 · PrimaryPublished 2d agoLive · 22h ago

Reading Order Inference for Complex Document Layouts

Reading order inference remains a critical bottleneck in the digitization of complex historical manuscripts, where pages contain multiple spatially interleaved reading streams, the canonical example being the Glossa Ordinaria layout, in which a central text is surrounded by commentaries that wrap around it in non-rectangular, non-convex regions. We present a training-free, graph-based framework: each OCR text line becomes a node in a directed candidate-transition graph, edges are scored by a weighted additive ensemble of two lightweight language-model signals (causal language model conditional

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Why these links exist

Linked via arxiv authorIddo Hakim →
Reading Order Inference for Complex Document Layouts
Linked via arxiv authorSharva Gogawale →
Reading Order Inference for Complex Document Layouts
Linked via arxiv authorOmer Ventura →
Reading Order Inference for Complex Document Layouts
Linked via arxiv authorGal Grudka →
Reading Order Inference for Complex Document Layouts
Linked via arxiv authorDaria Vasyutinsky-Shapira →
Reading Order Inference for Complex Document Layouts
Linked via arxiv authorBerat Kurar-Barakat →
Reading Order Inference for Complex Document Layouts
Linked via arxiv authorNachum Dershowitz →
Reading Order Inference for Complex Document Layouts

Covers

newsFind the best open-source OCR models in one place at Papers with Code [P]newsDiffusionGemma: 4x faster text generation

authored (incoming)

personIddo Hakim personSharva Gogawale personOmer Ventura personGal Grudka personDaria Vasyutinsky-Shapira personBerat Kurar-Barakat personNachum Dershowitz

Related across the graph

newsDiffusionGemma: 4x faster text generation personDaria Vasyutinsky-Shapira personIddo Hakim personBerat Kurar-Barakat personGal Grudka newsFind the best open-source OCR models in one place at Papers with Code [P]personSharva Gogawale personNachum Dershowitz personOmer Ventura

Topics

cs.AI