A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

How to certify machine learning based safety-critical systems? A systematic literature review

F Tambon, G Laberge, L An, A Nikanjam… - Automated Software …, 2022 - Springer
Abstract Context Machine Learning (ML) has been at the heart of many innovations over the
past years. However, including it in so-called “safety-critical” systems such as automotive or …

NNV: the neural network verification tool for deep neural networks and learning-enabled cyber-physical systems

HD Tran, X Yang, D Manzanas Lopez, P Musau… - … conference on computer …, 2020 - Springer
This paper presents the Neural Network Verification (NNV) software tool, a set-based
verification framework for deep neural networks (DNNs) and learning-enabled cyber …

Safe-state enhancement method for autonomous driving via direct hierarchical reinforcement learning

Z Gu, L Gao, H Ma, SE Li, S Zheng… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Reinforcement learning (RL) has shown excellent performance in the sequential decision-
making problem, where safety in the form of state constraints is of great significance in the …

Reinforcement learning for temporal logic control synthesis with probabilistic satisfaction guarantees

M Hasanbeig, Y Kantaros, A Abate… - 2019 IEEE 58th …, 2019 - ieeexplore.ieee.org
We present a model-free reinforcement learning algorithm to synthesize control policies that
maximize the probability of satisfying high-level control objectives given as Linear Temporal …

Neurosymbolic reinforcement learning with formally verified exploration

G Anderson, A Verma, I Dillig… - Advances in neural …, 2020 - proceedings.neurips.cc
We present REVEL, a partially neural reinforcement learning (RL) framework for provably
safe exploration in continuous state and action spaces. A key challenge for provably safe …

Cautious reinforcement learning with logical constraints

M Hasanbeig, A Abate, D Kroening - arxiv preprint arxiv:2002.12156, 2020 - arxiv.org
This paper presents the concept of an adaptive safe padding that forces Reinforcement
Learning (RL) to synthesise optimal control policies while ensuring safety during the …

Safe reinforcement learning via probabilistic shields

N Jansen, B Könighofer, S Junges, AC Serban… - arxiv preprint arxiv …, 2018 - arxiv.org
This paper targets the efficient construction of a safety shield for decision making in
scenarios that incorporate uncertainty. Markov decision processes (MDPs) are prominent …

Safe reinforcement learning for autonomous lane changing using set-based prediction

H Krasowski, X Wang, M Althoff - 2020 IEEE 23rd international …, 2020 - ieeexplore.ieee.org
Machine learning approaches often lack safety guarantees, which are often a key
requirement in real-world tasks. This paper addresses the lack of safety guarantees by …

Enforcing policy feasibility constraints through differentiable projection for energy optimization

B Chen, PL Donti, K Baker, JZ Kolter… - Proceedings of the Twelfth …, 2021 - dl.acm.org
While reinforcement learning (RL) is gaining popularity in energy systems control, its real-
world applications are limited due to the fact that the actions from learned policies may not …