paperarXivTrust 82 · PrimaryPublished 7d agoLive · 4d ago

TextDS: Parameter-Efficient Representation Alignment for Scene Text Detection under Distribution Shifts

In real-world deployments, scene text detectors inevitably face distribution shifts beyond the training distribution. Prior work often depends on large-scale scene-text pretraining, yet evaluation under cross-domain changes and real-world imaging degradations remains limited. We propose TextDS, an efficient framework for scene text detection under distribution shifts. First, we propose a data-efficient dual-encoder design with visual foundation models, eliminating the reliance on large-scale scene-text pretraining. Second, we introduce Step-wise LoRA adaptation (SWLoRA), which performs progres

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Topics

cs.CV