Constructing Concept-based Models to Mitigate Spurious Correlations with Minimal Human Effort

Kim, Jeeyung; Wang, Ze; Qiu, Qiang

Computer Science > Machine Learning

arXiv:2407.08947 (cs)

[Submitted on 12 Jul 2024]

Title:Constructing Concept-based Models to Mitigate Spurious Correlations with Minimal Human Effort

Authors:Jeeyung Kim, Ze Wang, Qiang Qiu

View PDF HTML (experimental)

Abstract:Enhancing model interpretability can address spurious correlations by revealing how models draw their predictions. Concept Bottleneck Models (CBMs) can provide a principled way of disclosing and guiding model behaviors through human-understandable concepts, albeit at a high cost of human efforts in data annotation. In this paper, we leverage a synergy of multiple foundation models to construct CBMs with nearly no human effort. We discover undesirable biases in CBMs built on pre-trained models and propose a novel framework designed to exploit pre-trained models while being immune to these biases, thereby reducing vulnerability to spurious correlations. Specifically, our method offers a seamless pipeline that adopts foundation models for assessing potential spurious correlations in datasets, annotating concepts for images, and refining the annotations for improved robustness. We evaluate the proposed method on multiple datasets, and the results demonstrate its effectiveness in reducing model reliance on spurious correlations while preserving its interpretability.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.08947 [cs.LG]
	(or arXiv:2407.08947v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.08947

Submission history

From: Jeeyung Kim [view email]
[v1] Fri, 12 Jul 2024 03:07:28 UTC (8,024 KB)

Computer Science > Machine Learning

Title:Constructing Concept-based Models to Mitigate Spurious Correlations with Minimal Human Effort

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Constructing Concept-based Models to Mitigate Spurious Correlations with Minimal Human Effort

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators