A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

[PDF][PDF] A comprehensive survey on safe reinforcement learning

J Garcıa, F Fernández - Journal of Machine Learning Research, 2015 - jmlr.org
Abstract Safe Reinforcement Learning can be defined as the process of learning policies
that maximize the expectation of the return in problems in which it is important to ensure …

Robust reinforcement learning using offline data

K Panaganti, Z Xu, D Kalathil… - Advances in neural …, 2022 - proceedings.neurips.cc
The goal of robust reinforcement learning (RL) is to learn a policy that is robust against the
uncertainty in model parameters. Parameter uncertainty commonly occurs in many real …

Risk-constrained reinforcement learning with percentile risk criteria

Y Chow, M Ghavamzadeh, L Janson… - Journal of Machine …, 2018 - jmlr.org
In many sequential decision-making problems one is interested in minimizing an expected
cumulative cost while taking into account risk, ie, increased awareness of events of small …

Sample complexity of robust reinforcement learning with a generative model

K Panaganti, D Kalathil - International Conference on …, 2022 - proceedings.mlr.press
Abstract The Robust Markov Decision Process (RMDP) framework focuses on designing
control policies that are robust against the parameter uncertainties due to the mismatches …

Algorithms for CVaR optimization in MDPs

Y Chow, M Ghavamzadeh - Advances in neural information …, 2014 - proceedings.neurips.cc
In many sequential decision-making problems we may want to manage risk by minimizing
some measure of variability in costs in addition to minimizing a standard criterion …

A review of safe reinforcement learning: Methods, theories and applications

S Gu, L Yang, Y Du, G Chen, F Walter… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Actor-critic algorithms for risk-sensitive MDPs

P La, M Ghavamzadeh - Advances in neural information …, 2013 - proceedings.neurips.cc
In many sequential decision-making problems we may want to manage risk by minimizing
some measure of variability in rewards in addition to maximizing a standard criterion …

Exponential bellman equation and improved regret bounds for risk-sensitive reinforcement learning

Y Fei, Z Yang, Y Chen, Z Wang - Advances in neural …, 2021 - proceedings.neurips.cc
We study risk-sensitive reinforcement learning (RL) based on the entropic risk measure.
Although existing works have established non-asymptotic regret guarantees for this …

Risk-sensitive reinforcement learning applied to control under constraints

P Geibel, F Wysotzki - Journal of Artificial Intelligence Research, 2005 - jair.org
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states
are those states entering which is undesirable or dangerous. We define the risk with respect …