Reward machines: Exploiting reward function structure in reinforcement learning

RT Icarte, TQ Klassen, R Valenzano… - Journal of Artificial …, 2022 - jair.org
Reinforcement learning (RL) methods usually treat reward functions as black boxes. As
such, these methods must extensively interact with the environment in order to discover …

On the expressivity of markov reward

D Abel, W Dabney, A Harutyunyan… - Advances in …, 2021 - proceedings.neurips.cc
Reward is the driving force for reinforcement-learning agents. This paper is dedicated to
understanding the expressivity of reward as a way to capture tasks that we would want an …

Compositional reinforcement learning from logical specifications

K Jothimurugan, S Bansal… - Advances in Neural …, 2021 - proceedings.neurips.cc
We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …

Automated verification and synthesis of stochastic hybrid systems: A survey

A Lavaei, S Soudjani, A Abate, M Zamani - Automatica, 2022 - Elsevier
Stochastic hybrid systems have received significant attentions as a relevant modeling
framework describing many systems, from engineering to the life sciences: they enable the …

(Ir) rationality in AI: State of the Art, Research Challenges and Open Questions

O Macmillan-Scott, M Musolesi - arxiv preprint arxiv:2311.17165, 2023 - arxiv.org
The concept of rationality is central to the field of artificial intelligence. Whether we are
seeking to simulate human reasoning, or the goal is to achieve bounded optimality, we …

Reinforcement learning with knowledge representation and reasoning: A brief survey

C Yu, X Zheng, HH Zhuo, H Wan, W Luo - arxiv preprint arxiv:2304.12090, 2023 - arxiv.org
Reinforcement Learning (RL) has achieved tremendous development in recent years, but
still faces significant obstacles in addressing complex real-life problems due to the issues of …

Instructing goal-conditioned reinforcement learning agents with temporal logic objectives

W Qiu, W Mao, H Zhu - Advances in Neural Information …, 2023 - proceedings.neurips.cc
Goal-conditioned reinforcement learning (RL) is a powerful approach for learning general-
purpose skills by reaching diverse goals. However, it has limitations when it comes to task …

On the limitations of markovian rewards to express multi-objective, risk-sensitive, and modal tasks

J Skalse, A Abate - Uncertainty in Artificial Intelligence, 2023 - proceedings.mlr.press
In this paper, we study the expressivity of scalar, Markovian reward functions in
Reinforcement Learning (RL), and identify several limitations to what they can express …

Certified reinforcement learning with logic guidance

H Hasanbeig, D Kroening, A Abate - Artificial Intelligence, 2023 - Elsevier
Reinforcement Learning (RL) is a widely employed machine learning architecture that has
been applied to a variety of control problems. However, applications in safety-critical …

Preferential cyber defense for power grids

M Moradi, Y Weng, J Dirkman, YC Lai - PRX Energy, 2023 - APS
The integration of computing and communication capabilities into the power grid has led to
vulnerabilities enabling attackers to launch cyberattacks on the grid. The resources that can …