paperarXivTrust 82 · PrimaryPublished 3d agoLive · 2d ago

No Place to Hide: Benchmarking Video Hallucination with Background-Controlled Pairs

We introduce VidPair-Halluc, a new benchmark for evaluating video hallucination in large video models (LVMs) under rigorous and controlled conditions. Unlike previous benchmarks that primarily rely on text-based perturbations or adversarial questions while neglecting the consistency of visual backgrounds, VidPair-Halluc features video pairs with highly similar backgrounds but distinctly different foreground semantics, enabling precise attribution of model errors to genuine hallucination rather than background variation. The benchmark is constructed through PairFlow, a pipeline that leverages r

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

newsInto the Omniverse: Three Workflows for Improving Vision AI Agent Accuracy With Synthetic Data and Fine-Tuning

Implements (incoming)

repovoxel51/fiftyone

Related across the graph

newsInto the Omniverse: Three Workflows for Improving Vision AI Agent Accuracy With Synthetic Data and Fine-Tuning repovoxel51/fiftyone

Topics

cs.CV