Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead

doi:10.1038/s42256-019-0048-x

. 2019 May;1(5):206-215.

doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead

Cynthia Rudin¹

Affiliations

PMID: 35603010
PMCID: PMC9122117
DOI: 10.1038/s42256-019-0048-x

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead

Cynthia Rudin. Nat Mach Intell. 2019 May.

. 2019 May;1(5):206-215.

doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.

Author

Cynthia Rudin¹

Affiliation

¹ Duke University.

PMID: 35603010
PMCID: PMC9122117
DOI: 10.1038/s42256-019-0048-x

Abstract

Black box machine learning models are currently being used for high stakes decision-making throughout society, causing problems throughout healthcare, criminal justice, and in other domains. People have hoped that creating methods for explaining these black box models will alleviate some of these problems, but trying to explain black box models, rather than creating models that are interpretable in the first place, is likely to perpetuate bad practices and can potentially cause catastrophic harm to society. There is a way forward - it is to design models that are inherently interpretable. This manuscript clarifies the chasm between explaining black boxes and using inherently interpretable models, outlines several key reasons why explainable black boxes should be avoided in high-stakes decisions, identifies challenges to interpretable machine learning, and provides several example applications where interpretable models could potentially replace black box models in criminal justice, healthcare, and computer vision.

PubMed Disclaimer

Figures

**Figure 1:**
A fictional depiction of the “accuracy-interpretability trade-off,” taken from the DARPA XAI (Explainable Artificial Intelligence) Broad Agency Announcement [18].

**Figure 2:**
Saliency does not explain anything except where the network is looking. We have no idea why this image is labeled as either a dog or a musical instrument when considering only saliency. The explanations look essentially the same for both classes. Figure credit: Chaofan Chen and [28].

**Figure 3:**
This is a machine learning model from the Certifiably Optimal Rule Lists (CORELS) algorithm [32]. This model is the minimizer of a special case of Equation 1 discussed later in the challenges section. CORELS’ code is open source and publicly available at http://corels.eecs.harvard.edu/, along with the data from Florida needed to produce this model.

**Figure 4:**
Scoring system for risk of recidivism from [21] [which grew out of 30, 44, 45]. This model was not created by a human; the selection of numbers and features come from the RiskSLIM machine learning algorithm.

**Figure 5:**
Image from the authors of [49], indicating that parts of the test image on the left are similar to prototypical parts of training examples. The test image to be classified is on the left, the most similar prototypes are in the middle column, and the heatmaps that show which part of the test image is similar to the prototype are on the right. We included copies of the test image on the right so that it is easier to see what part of the bird the heatmaps are referring to. The similarities of the prototypes to the test image are what determine the predicted class label of the image. Here, the image is predicted to be a clay-colored sparrow. The top prototype seems to be comparing the bird’s head to a prototypical head of a clay-colored sparrow, the second prototype considers the throat of the bird, the third looks at feathers, and the last seems to consider the abdomen and leg. Test image from [50]. Prototypes from [51, 52, 53, 54]. Image constructed by Alina Barnett.

See this image and copyright information in PMC

Cited by

FAIM: Fairness-aware interpretable modeling for trustworthy machine learning in healthcare.
Liu M, Ning Y, Ke Y, Shang Y, Chakraborty B, Ong MEH, Vaughan R, Liu N. Liu M, et al. Patterns (N Y). 2024 Sep 12;5(10):101059. doi: 10.1016/j.patter.2024.101059. eCollection 2024 Oct 11. Patterns (N Y). 2024. PMID: 39569213 Free PMC article.
Avoiding common machine learning pitfalls.
Lones MA. Lones MA. Patterns (N Y). 2024 Aug 28;5(10):101046. doi: 10.1016/j.patter.2024.101046. eCollection 2024 Oct 11. Patterns (N Y). 2024. PMID: 39569205 Free PMC article. Review.
Leveraging explainable artificial intelligence for early prediction of bloodstream infections using historical electronic health records.
Bopche R, Gustad LT, Afset JE, Ehrnström B, Damås JK, Nytrø Ø. Bopche R, et al. PLOS Digit Health. 2024 Nov 14;3(11):e0000506. doi: 10.1371/journal.pdig.0000506. eCollection 2024 Nov. PLOS Digit Health. 2024. PMID: 39541276 Free PMC article.
On Leveraging Machine Learning in Sport Science in the Hypothetico-deductive Framework.
Rodu J, DeJong Lempke AF, Kupperman N, Hertel J. Rodu J, et al. Sports Med Open. 2024 Nov 14;10(1):124. doi: 10.1186/s40798-024-00788-4. Sports Med Open. 2024. PMID: 39541034 Free PMC article.
Artefact design and societal worldview.
Stockinger E, Mandlmayr M. Stockinger E, et al. Philos Trans A Math Phys Eng Sci. 2024 Dec 16;382(2285):20240092. doi: 10.1098/rsta.2024.0092. Epub 2024 Nov 13. Philos Trans A Math Phys Eng Sci. 2024. PMID: 39533920 Free PMC article.

See all "Cited by" articles

References

1. Wexler R When a Computer Program Keeps You in Jail: How Computers are Harming Criminal Justice. New York Times. 2017. June 13;.
1. McGough M How bad is Sacramento’s air, exactly? Google results appear at odds with reality, some say. Sacramento Bee. 2018. August 7;.
1. Varshney KR, Alemzadeh H. On the safety of machine learning: Cyber-physical systems, decision sciences, and data products. Big Data. 2016. 10;5. - PubMed
1. Freitas AA. Comprehensible classification models: a position paper. ACM SIGKDD Explorations Newsletter. 2014. Mar;15(1):1–10.
1. Kodratoff Y. The comprehensibility manifesto. KDD Nugget Newsletter. 1994;94(9).

Grants and funding

R01 EB025021/EB/NIBIB NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- H1 Connect
- The Lens - Patent Citations Database
Research Materials
- NCI CPTC Antibody Characterization Program

[1] Wexler R When a Computer Program Keeps You in Jail: How Computers are Harming Criminal Justice. New York Times. 2017. June 13;.

[2] Wexler R When a Computer Program Keeps You in Jail: How Computers are Harming Criminal Justice. New York Times. 2017. June 13;.

[3] McGough M How bad is Sacramento’s air, exactly? Google results appear at odds with reality, some say. Sacramento Bee. 2018. August 7;.

[4] McGough M How bad is Sacramento’s air, exactly? Google results appear at odds with reality, some say. Sacramento Bee. 2018. August 7;.

[5] Varshney KR, Alemzadeh H. On the safety of machine learning: Cyber-physical systems, decision sciences, and data products. Big Data. 2016. 10;5. - PubMed

[6] Varshney KR, Alemzadeh H. On the safety of machine learning: Cyber-physical systems, decision sciences, and data products. Big Data. 2016. 10;5. - PubMed

[7] Freitas AA. Comprehensible classification models: a position paper. ACM SIGKDD Explorations Newsletter. 2014. Mar;15(1):1–10.

[8] Freitas AA. Comprehensible classification models: a position paper. ACM SIGKDD Explorations Newsletter. 2014. Mar;15(1):1–10.

[9] Kodratoff Y. The comprehensibility manifesto. KDD Nugget Newsletter. 1994;94(9).

[10] Kodratoff Y. The comprehensibility manifesto. KDD Nugget Newsletter. 1994;94(9).

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead

Affiliation

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead

Author

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials

Abstract

Figures

Similar articles

Cited by

References

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials