Read original ↗
paperarXivTrust 82 · PrimaryPublished 2d agoLive · 21h ago

GenAU: Language-Grounded Industrial Anomaly Understanding with Vision-Language Models

Industrial inspection requires more than binary anomaly detection: a practical system should determine whether an anomaly exists, localize the defective region, identify the defect type, and provide interpretable visual evidence. Existing CLIP-based methods detect and localize anomalies well but offer limited language-level defect understanding, while instruction-tuned vision-language models can describe defects but do not natively produce pixel-level masks. We introduce GenAU, a Generalist vision-language framework for industrial Anomaly Understanding that unifies image-level detection, pixel

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Why these links exist

  • Linked via arxiv authorHongkuan Zhou

    GenAU: Language-Grounded Industrial Anomaly Understanding with Vision-Language Models

  • Linked via arxiv authorTristan Rehm

    GenAU: Language-Grounded Industrial Anomaly Understanding with Vision-Language Models

  • Linked via arxiv authorNadeem Nazer

    GenAU: Language-Grounded Industrial Anomaly Understanding with Vision-Language Models

  • Linked via arxiv authorLavdim Halilaj

    GenAU: Language-Grounded Industrial Anomaly Understanding with Vision-Language Models

  • Linked via arxiv authorJingcheng Wu

    GenAU: Language-Grounded Industrial Anomaly Understanding with Vision-Language Models

  • Linked via arxiv authorSteffen Staab

    GenAU: Language-Grounded Industrial Anomaly Understanding with Vision-Language Models

Implements

Covers

Has model

authored (incoming)

Related across the graph

Topics