paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago
Efficient RGB-T Object Detection via Sparse Cross-Modality Fusion
RGB-T detectors leverage the complementary strengths of visible and thermal infrared modalities, achieving robust performance under challenging conditions. Many of them resort to heavy dual backbones and exhaustive cross-modality fusion across the entire image, leading to impractically high computational costs. We observe that most image regions are smooth backgrounds (e.g., sky, ground) that can be easily handled by lightweight single-modality models. In light of this observation, we propose a sparse fusion mechanism for efficient RGB-T detection: first rapidly scanning the image to identify
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
