Explainable deep reinforcement learning: state of the art and challenges

GA Vouros - ACM Computing Surveys, 2022 - dl.acm.org
Interpretability, explainability, and transparency are key issues to introducing artificial
intelligence methods in many critical domains. This is important due to ethical concerns and …

Explainable autonomous robots: a survey and perspective

T Sakai, T Nagai - Advanced Robotics, 2022 - Taylor & Francis
Advanced communication protocols are critical for the coexistence of autonomous robots
and humans. Thus, the development of explanatory capabilities in robots is an urgent first …

Interpretable machine learning: Fundamental principles and 10 grand challenges

C Rudin, C Chen, Z Chen, H Huang… - Statistic …, 2022 - projecteuclid.org
Interpretability in machine learning (ML) is crucial for high stakes decisions and
troubleshooting. In this work, we provide fundamental principles for interpretable ML, and …

Explainable reinforcement learning: A survey

E Puiutta, EMSP Veith - … cross-domain conference for machine learning …, 2020 - Springer
Abstract Explainable Artificial Intelligence (XAI), ie, the development of more transparent and
interpretable AI models, has gained increased traction over the last few years. This is due to …

Explainable reinforcement learning through a causal lens

P Madumal, T Miller, L Sonenberg… - Proceedings of the AAAI …, 2020 - ojs.aaai.org
Prominent theories in cognitive science propose that humans understand and represent the
knowledge of the world through causal relationships. In making sense of the world, we build …

A survey on interpretable reinforcement learning

C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao… - Machine Learning, 2024 - Springer
Although deep reinforcement learning has become a promising machine learning approach
for sequential decision-making problems, it is still not mature enough for high-stake domains …

Discovering symbolic policies with deep reinforcement learning

M Landajuela, BK Petersen, S Kim… - International …, 2021 - proceedings.mlr.press
Deep reinforcement learning (DRL) has proven successful for many difficult control
problems by learning policies represented by neural networks. However, the complexity of …

The emerging landscape of explainable ai planning and decision making

T Chakraborti, S Sreedharan… - arxiv preprint arxiv …, 2020 - arxiv.org
In this paper, we provide a comprehensive outline of the different threads of work in
Explainable AI Planning (XAIP) that has emerged as a focus area in the last couple of years …

Bridging the human-ai knowledge gap: Concept discovery and transfer in alphazero

L Schut, N Tomasev, T McGrath, D Hassabis… - arxiv preprint arxiv …, 2023 - arxiv.org
Artificial Intelligence (AI) systems have made remarkable progress, attaining super-human
performance across various domains. This presents us with an opportunity to further human …

Edge: Explaining deep reinforcement learning policies

W Guo, X Wu, U Khan, X **ng - Advances in Neural …, 2021 - proceedings.neurips.cc
With the rapid development of deep reinforcement learning (DRL) techniques, there is an
increasing need to understand and interpret DRL policies. While recent research has …