Read original ↗
paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago

On the Faithfulness of Post-Hoc Concept Bottleneck Models

Human decision-making interprets the world through high-level concepts, such as recognizing a bird by its belly color. To bridge the gap between opaque deep learning representations and human understanding, Post-Hoc Concept Bottleneck Models (post-hoc CBMs) project latent features onto interpretable concept spaces using auxiliary datasets or vision-language models. However, relying on target task accuracy as the primary measure of post-hoc CBM success obscures whether the learned concepts are semantically meaningful or merely predictive artifacts. For example, random concept projections can ac

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

Related across the graph

Topics