paperarXivTrust 82 · PrimaryPublished 2d agoLive · 21h ago

LongVQUBench: Benchmarking Long-Term Video Quality Understanding of Vision-Language Models

The evaluation of long-term video quality understanding remains an open challenge for large vision-language models (LVLMs). Existing video quality benchmarks predominantly focus on short clips and isolated distortions, overlooking the temporal continuity, cumulative degradation, and reasoning complexity inherent in long-duration content. To address these limitations, we present LongVQUBench, a comprehensive benchmark for long-term video quality understanding. LongVQUBench contains over 1200 diverse videos spanning movies, documentaries, surveillance footage, egocentric recordings, and animated

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Why these links exist

Linked via arxiv authorArpita Nema →
LongVQUBench: Benchmarking Long-Term Video Quality Understanding of Vision-Language Models
Linked via arxiv authorHanwei Zhu →
LongVQUBench: Benchmarking Long-Term Video Quality Understanding of Vision-Language Models
Linked via arxiv authorXi Zhang →
LongVQUBench: Benchmarking Long-Term Video Quality Understanding of Vision-Language Models
Linked via arxiv authorWeisi Lin →
LongVQUBench: Benchmarking Long-Term Video Quality Understanding of Vision-Language Models

Has model

modelVioletVision-3B

Implements

repovlm-starter repomrreviewai/ai-tools-for-content-creators

authored (incoming)

personArpita Nema personHanwei Zhu personXi Zhang personWeisi Lin

Covers (incoming)

newsVideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenization - Apple Machine Learning Research

Related across the graph

personXi Zhang personArpita Nema personHanwei Zhu repomrreviewai/ai-tools-for-content-creators modelVioletVision-3B newsVideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenization - Apple Machine Learning Research personWeisi Lin repovlm-starter

Topics

cs.AI