[HTML][HTML] XAI systems evaluation: a review of human and computer-centred methods

P Lopes, E Silva, C Braga, T Oliveira, L Rosado - Applied Sciences, 2022 - mdpi.com
The lack of transparency of powerful Machine Learning systems paired with their growth in
popularity over the last decade led to the emergence of the eXplainable Artificial Intelligence …

Interpretable machine learning for weather and climate prediction: A review

R Yang, J Hu, Z Li, J Mu, T Yu, J **a, X Li… - Atmospheric …, 2024 - Elsevier
Advanced machine learning models have recently achieved high predictive accuracy for
weather and climate prediction. However, these complex models often lack inherent …

Towards interpreting and mitigating shortcut learning behavior of NLU models

M Du, V Manjunatha, R Jain, R Deshpande… - arxiv preprint arxiv …, 2021 - arxiv.org
Recent studies indicate that NLU models are prone to rely on shortcut features for prediction,
without achieving true language understanding. As a result, these models fail to generalize …

Unifying fourteen post-hoc attribution methods with taylor interactions

H Deng, N Zou, M Du, W Chen, G Feng… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Various attribution methods have been developed to explain deep neural networks (DNNs)
by inferring the attribution/importance/contribution score of each input variable to the final …

Explainable artificial intelligence for feature selection in network traffic classification: A comparative study

P Khani, E Moeinaddini, ND Abnavi… - Transactions on …, 2024 - Wiley Online Library
Over the past decade, there has been a growing surge of interest in leveraging artificial
intelligence and machine learning models to address real‐world challenges within the field …

Understanding and unifying fourteen attribution methods with taylor interactions

H Deng, N Zou, M Du, W Chen, G Feng, Z Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
Various attribution methods have been developed to explain deep neural networks (DNNs)
by inferring the attribution/importance/contribution score of each input variable to the final …

Navigating the shortcut maze: A comprehensive analysis of shortcut learning in text classification by language models

Y Zhou, R Tang, Z Yao, Z Zhu - arxiv preprint arxiv:2409.17455, 2024 - arxiv.org
Language models (LMs), despite their advances, often depend on spurious correlations,
undermining their accuracy and generalizability. This study addresses the overlooked …

Interpretability of neural networks based on game-theoretic interactions

H Zhou, J Ren, H Deng, X Cheng, J Zhang… - Machine Intelligence …, 2024 - Springer
This paper introduces the system of game-theoretic interactions, which connects both the
explanation of knowledge encoded in a deep neural networks (DNN) and the explanation of …

[HTML][HTML] PredDiff: Explanations and interactions from conditional expectations

S Blücher, J Vielhaben, N Strodthoff - Artificial Intelligence, 2022 - Elsevier
PredDiff is a model-agnostic, local attribution method that is firmly rooted in probability
theory. Its simple intuition is to measure prediction changes while marginalizing features. In …

Generative perturbation analysis for probabilistic black-box anomaly attribution

T Idé, N Abe - Proceedings of the 29th ACM SIGKDD Conference on …, 2023 - dl.acm.org
We address the task of probabilistic anomaly attribution in the black-box regression setting,
where the goal is to compute the probability distribution of the attribution score of each input …