paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago
On the Faithfulness of Post-Hoc Concept Bottleneck Models
Human decision-making interprets the world through high-level concepts, such as recognizing a bird by its belly color. To bridge the gap between opaque deep learning representations and human understanding, Post-Hoc Concept Bottleneck Models (post-hoc CBMs) project latent features onto interpretable concept spaces using auxiliary datasets or vision-language models. However, relying on target task accuracy as the primary measure of post-hoc CBM success obscures whether the learned concepts are semantically meaningful or merely predictive artifacts. For example, random concept projections can ac
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
