Explainable deep reinforcement learning: state of the art and challenges

GA Vouros - ACM Computing Surveys, 2022 - dl.acm.org
Interpretability, explainability, and transparency are key issues to introducing artificial
intelligence methods in many critical domains. This is important due to ethical concerns and …

Explainable reinforcement learning: A survey and comparative review

S Milani, N Topin, M Veloso, F Fang - ACM Computing Surveys, 2024 - dl.acm.org
Explainable reinforcement learning (XRL) is an emerging subfield of explainable machine
learning that has attracted considerable attention in recent years. The goal of XRL is to …

Interpretable machine learning: Fundamental principles and 10 grand challenges

C Rudin, C Chen, Z Chen, H Huang… - Statistic …, 2022 - projecteuclid.org
Interpretability in machine learning (ML) is crucial for high stakes decisions and
troubleshooting. In this work, we provide fundamental principles for interpretable ML, and …

Explainable reinforcement learning: A survey

E Puiutta, EMSP Veith - … cross-domain conference for machine learning …, 2020 - Springer
Abstract Explainable Artificial Intelligence (XAI), ie, the development of more transparent and
interpretable AI models, has gained increased traction over the last few years. This is due to …

Discovering symbolic policies with deep reinforcement learning

M Landajuela, BK Petersen, S Kim… - International …, 2021 - proceedings.mlr.press
Deep reinforcement learning (DRL) has proven successful for many difficult control
problems by learning policies represented by neural networks. However, the complexity of …

A survey on interpretable reinforcement learning

C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao… - Machine Learning, 2024 - Springer
Although deep reinforcement learning has become a promising machine learning approach
for sequential decision-making problems, it is still not mature enough for high-stake domains …

Edge: Explaining deep reinforcement learning policies

W Guo, X Wu, U Khan, X **ng - Advances in Neural …, 2021 - proceedings.neurips.cc
With the rapid development of deep reinforcement learning (DRL) techniques, there is an
increasing need to understand and interpret DRL policies. While recent research has …

A survey of explainable reinforcement learning

S Milani, N Topin, M Veloso, F Fang - arxiv preprint arxiv:2202.08434, 2022 - arxiv.org
Explainable reinforcement learning (XRL) is an emerging subfield of explainable machine
learning that has attracted considerable attention in recent years. The goal of XRL is to …

Human-in-the-loop reinforcement learning: A survey and position on requirements, challenges, and opportunities

CO Retzlaff, S Das, C Wayllace, P Mousavi… - Journal of Artificial …, 2024 - jair.org
Artificial intelligence (AI) and especially reinforcement learning (RL) have the potential to
enable agents to learn and perform tasks autonomously with superhuman performance …

Reinforcement learning interpretation methods: A survey

A Alharin, TN Doan, M Sartipi - IEEE Access, 2020 - ieeexplore.ieee.org
Reinforcement Learning (RL) systems achieved outstanding performance in different
domains such as Atari games, finance, healthcare, and self-driving cars. However, their …