- Academic Search

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org

Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Save Cite Cited by 297 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] jmlr.org

[PDF][PDF] A comprehensive survey on safe reinforcement learning

J Garcıa, F Fernández - Journal of Machine Learning Research, 2015 - jmlr.org

Abstract Safe Reinforcement Learning can be defined as the process of learning policies
that maximize the expectation of the return in problems in which it is important to ensure …

Save Cite Cited by 2092 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Robust reinforcement learning using offline data

K Panaganti, Z Xu, D Kalathil… - Advances in neural …, 2022 - proceedings.neurips.cc

The goal of robust reinforcement learning (RL) is to learn a policy that is robust against the
uncertainty in model parameters. Parameter uncertainty commonly occurs in many real …

Save Cite Cited by 88 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] jmlr.org

Risk-constrained reinforcement learning with percentile risk criteria

Y Chow, M Ghavamzadeh, L Janson… - Journal of Machine …, 2018 - jmlr.org

In many sequential decision-making problems one is interested in minimizing an expected
cumulative cost while taking into account risk, ie, increased awareness of events of small …

Save Cite Cited by 628 Related articles All 12 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Sample complexity of robust reinforcement learning with a generative model

K Panaganti, D Kalathil - International Conference on …, 2022 - proceedings.mlr.press

Abstract The Robust Markov Decision Process (RMDP) framework focuses on designing
control policies that are robust against the parameter uncertainties due to the mismatches …

Save Cite Cited by 85 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Algorithms for CVaR optimization in MDPs

Y Chow, M Ghavamzadeh - Advances in neural information …, 2014 - proceedings.neurips.cc

In many sequential decision-making problems we may want to manage risk by minimizing
some measure of variability in costs in addition to minimizing a standard criterion …

Save Cite Cited by 407 Related articles All 11 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] kcl.ac.uk

A review of safe reinforcement learning: Methods, theories and applications

S Gu, L Yang, Y Du, G Chen, F Walter… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Save Cite Cited by 13 Related articles All 8 versions Free GPT-4

[Free GPT-4]

[PDF] neurips.cc

Actor-critic algorithms for risk-sensitive MDPs

P La, M Ghavamzadeh - Advances in neural information …, 2013 - proceedings.neurips.cc

In many sequential decision-making problems we may want to manage risk by minimizing
some measure of variability in rewards in addition to maximizing a standard criterion …

Save Cite Cited by 338 Related articles All 28 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Exponential bellman equation and improved regret bounds for risk-sensitive reinforcement learning

Y Fei, Z Yang, Y Chen, Z Wang - Advances in neural …, 2021 - proceedings.neurips.cc

We study risk-sensitive reinforcement learning (RL) based on the entropic risk measure.
Although existing works have established non-asymptotic regret guarantees for this …

Save Cite Cited by 67 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] jair.org

Risk-sensitive reinforcement learning applied to control under constraints

P Geibel, F Wysotzki - Journal of Artificial Intelligence Research, 2005 - jair.org

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states
are those states entering which is undesirable or dangerous. We define the risk with respect …

Save Cite Cited by 449 Related articles All 19 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Q-learning for risk-sensitive control

A review of safe reinforcement learning: Methods, theory and applications

[PDF][PDF] A comprehensive survey on safe reinforcement learning

Robust reinforcement learning using offline data

Risk-constrained reinforcement learning with percentile risk criteria

Sample complexity of robust reinforcement learning with a generative model

Algorithms for CVaR optimization in MDPs

A review of safe reinforcement learning: Methods, theories and applications

Actor-critic algorithms for risk-sensitive MDPs

Exponential bellman equation and improved regret bounds for risk-sensitive reinforcement learning

Risk-sensitive reinforcement learning applied to control under constraints