A survey of constraint formulations in safe reinforcement learning

A Wachi, X Shen, Y Sui - arxiv preprint arxiv:2402.02025, 2024 - arxiv.org
Safety is critical when applying reinforcement learning (RL) to real-world problems. As a
result, safe RL has emerged as a fundamental and powerful paradigm for optimizing an …

Safe reinforcement learning for power system control: A review

P Yu, Z Wang, H Zhang, Y Song - arxiv preprint arxiv:2407.00681, 2024 - arxiv.org
The large-scale integration of intermittent renewable energy resources introduces increased
uncertainty and volatility to the supply side of power systems, thereby complicating system …

Safe offline reinforcement learning with feasibility-guided diffusion model

Y Zheng, J Li, D Yu, Y Yang, SE Li, X Zhan… - arxiv preprint arxiv …, 2024 - arxiv.org
Safe offline RL is a promising way to bypass risky online interactions towards safe policy
learning. Most existing methods only enforce soft constraints, ie, constraining safety …

Empowering autonomous driving with large language models: A safety perspective

Y Wang, R Jiao, SS Zhan, C Lang, C Huang… - arxiv preprint arxiv …, 2023 - arxiv.org
Autonomous Driving (AD) encounters significant safety hurdles in long-tail unforeseen
driving scenarios, largely stemming from the non-interpretability and poor generalization of …

Iterative reachability estimation for safe reinforcement learning

M Ganai, Z Gong, C Yu, S Herbert… - Advances in Neural …, 2023 - proceedings.neurips.cc
Ensuring safety is important for the practical deployment of reinforcement learning (RL).
Various challenges must be addressed, such as handling stochasticity in the environments …

Compositional policy learning in stochastic control systems with formal guarantees

Đ Žikelić, M Lechner, A Verma… - Advances in …, 2023 - proceedings.neurips.cc
Reinforcement learning has shown promising results in learning neural network policies for
complicated control tasks. However, the lack of formal guarantees about the behavior of …

Safe exploration in reinforcement learning: A generalized formulation and algorithms

A Wachi, W Hashimoto, X Shen… - Advances in Neural …, 2023 - proceedings.neurips.cc
Safe exploration is essential for the practical use of reinforcement learning (RL) in many real-
world scenarios. In this paper, we present a generalized safe exploration (GSE) problem as …

Polar-express: Efficient and precise formal reachability analysis of neural-network controlled systems

Y Wang, W Zhou, J Fan, Z Wang, J Li… - … on Computer-Aided …, 2023 - ieeexplore.ieee.org
Neural networks (NNs) playing the role of controllers have demonstrated impressive
empirical performance on challenging control problems. However, the potential adoption of …

State-wise safe reinforcement learning with pixel observations

S Zhan, Y Wang, Q Wu, R Jiao… - 6th Annual Learning …, 2024 - proceedings.mlr.press
In the context of safe exploration, Reinforcement Learning (RL) has long grappled with the
challenges of balancing the tradeoff between maximizing rewards and minimizing safety …

POLICEd RL: Learning closed-loop robot control policies with provable satisfaction of hard constraints

JB Bouvier, K Nagpal, N Mehr - arxiv preprint arxiv:2403.13297, 2024 - arxiv.org
In this paper, we seek to learn a robot policy guaranteed to satisfy state constraints. To
encourage constraint satisfaction, existing RL algorithms typically rely on Constrained …