A survey on the explainability of supervised machine learning

N Burkart, MF Huber - Journal of Artificial Intelligence Research, 2021 - jair.org
Predictions obtained by, eg, artificial neural networks have a high accuracy but humans
often perceive the models as black boxes. Insights about the decision making are mostly …

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Interpretable machine learning: Fundamental principles and 10 grand challenges

C Rudin, C Chen, Z Chen, H Huang… - Statistic …, 2022 - projecteuclid.org
Interpretability in machine learning (ML) is crucial for high stakes decisions and
troubleshooting. In this work, we provide fundamental principles for interpretable ML, and …

Openxai: Towards a transparent evaluation of model explanations

C Agarwal, S Krishna, E Saxena… - Advances in neural …, 2022 - proceedings.neurips.cc
While several types of post hoc explanation methods have been proposed in recent
literature, there is very little work on systematically benchmarking these methods. Here, we …

Interpretable machine learning–a brief history, state-of-the-art and challenges

C Molnar, G Casalicchio, B Bischl - Joint European conference on …, 2020 - Springer
We present a brief history of the field of interpretable machine learning (IML), give an
overview of state-of-the-art interpretation methods and discuss challenges. Research in IML …

Are explanations helpful? a comparative study of the effects of explanations in ai-assisted decision-making

X Wang, M Yin - Proceedings of the 26th International Conference on …, 2021 - dl.acm.org
This paper contributes to the growing literature in empirical evaluation of explainable AI
(XAI) methods by presenting a comparison on the effects of a set of established XAI methods …

[HTML][HTML] Evaluating XAI: A comparison of rule-based and example-based explanations

J van der Waa, E Nieuwburg, A Cremers, M Neerincx - Artificial intelligence, 2021 - Elsevier
Abstract Current developments in Artificial Intelligence (AI) led to a resurgence of
Explainable AI (XAI). New methods are being researched to obtain information from AI …

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

C Rudin - Nature machine intelligence, 2019 - nature.com
Black box machine learning models are currently being used for high-stakes decision
making throughout society, causing problems in healthcare, criminal justice and other …

Peeking inside the black-box: a survey on explainable artificial intelligence (XAI)

A Adadi, M Berrada - IEEE access, 2018 - ieeexplore.ieee.org
At the dawn of the fourth industrial revolution, we are witnessing a fast and widespread
adoption of artificial intelligence (AI) in our daily life, which contributes to accelerating the …

Explaining explanations in AI

B Mittelstadt, C Russell, S Wachter - Proceedings of the conference on …, 2019 - dl.acm.org
Recent work on interpretability in machine learning and AI has focused on the building of
simplified models that approximate the true criteria used to make decisions. These models …