" Help Me Help the AI": Understanding How Explainability Can Support Human-AI Interaction

SSY Kim, EA Watkins, O Russakovsky, R Fong… - Proceedings of the …, 2023 - dl.acm.org
Despite the proliferation of explainable AI (XAI) methods, little is understood about end-
users' explainability needs and behaviors around XAI explanations. To address this gap and …

Multi-dimensional concept discovery (MCD): A unifying framework with completeness guarantees

J Vielhaben, S Bluecher, N Strodthoff - arxiv preprint arxiv:2301.11911, 2023 - arxiv.org
The completeness axiom renders the explanation of a post-hoc XAI method only locally
faithful to the model, ie for a single decision. For the trustworthy application of XAI, in …

Backpack language models

J Hewitt, J Thickstun, CD Manning, P Liang - arxiv preprint arxiv …, 2023 - arxiv.org
We present Backpacks: a new neural architecture that marries strong modeling performance
with an interface for interpretability and control. Backpacks learn multiple non-contextual …

N $\textA^\text2 $ Q: Neural Attention Additive Model for Interpretable Multi-Agent Q-Learning

Z Liu, Y Zhu, C Chen - International Conference on Machine …, 2023 - proceedings.mlr.press
Value decomposition is widely used in cooperative multi-agent reinforcement learning,
however, its implicit credit assignment mechanism is not yet fully understood due to black …

Curve your enthusiasm: concurvity regularization in differentiable generalized additive models

J Siems, K Ditschuneit, W Ripken… - Advances in …, 2023 - proceedings.neurips.cc
Abstract Generalized Additive Models (GAMs) have recently experienced a resurgence in
popularity due to their interpretability, which arises from expressing the target value as a …

GRAND-SLAMIN'Interpretable Additive Modeling with Structural Constraints

S Ibrahim, G Afriat, K Behdin… - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract Generalized Additive Models (GAMs) are a family of flexible and interpretable
models with old roots in statistics. GAMs are often used with pairwise interactions to improve …

Pseudo-class part prototype networks for interpretable breast cancer classification

MA Choukali, MC Amirani, M Valizadeh, A Abbasi… - Scientific Reports, 2024 - nature.com
Interpretability in machine learning has become increasingly important as machine learning
is being used in more and more applications, including those with high-stakes …

Cat: Interpretable concept-based taylor additive models

V Duong, Q Wu, Z Zhou, H Zhao, C Luo… - Proceedings of the 30th …, 2024 - dl.acm.org
As an emerging interpretable technique, Generalized Additive Models (GAMs) adopt neural
networks to individually learn non-linear functions for each feature, which are then combined …

Neural additive models for location scale and shape: A framework for interpretable neural regression beyond the mean

AF Thielmann, RM Kruse, T Kneib… - International …, 2024 - proceedings.mlr.press
Deep neural networks (DNNs) have proven to be highly effective in a variety of tasks,
making them the go-to method for problems requiring high-level predictive power. Despite …

A Comprehensive Survey on Self-Interpretable Neural Networks

Y Ji, Y Sun, Y Zhang, Z Wang, Y Zhuang… - arxiv preprint arxiv …, 2025 - arxiv.org
Neural networks have achieved remarkable success across various fields. However, the
lack of interpretability limits their practical use, particularly in critical decision-making …