Robust reinforcement learning: A review of foundations and recent advances

J Moos, K Hansel, H Abdulsamad, S Stark… - Machine Learning and …, 2022 - mdpi.com
Reinforcement learning (RL) has become a highly successful framework for learning in
Markov decision processes (MDP). Due to the adoption of RL in realistic and complex …

Frameworks and results in distributionally robust optimization

H Rahimian, S Mehrotra - Open Journal of Mathematical Optimization, 2022 - numdam.org
The concepts of risk aversion, chance-constrained optimization, and robust optimization
have developed significantly over the last decade. The statistical learning community has …

Reinforcement learning in healthcare: A survey

C Yu, J Liu, S Nemati, G Yin - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
As a subfield of machine learning, reinforcement learning (RL) aims at optimizing decision
making by using interaction samples of an agent with its environment and the potentially …

Robust deep reinforcement learning against adversarial perturbations on state observations

H Zhang, H Chen, C **ao, B Li, M Liu… - Advances in …, 2020 - proceedings.neurips.cc
A deep reinforcement learning (DRL) agent observes its states through observations, which
may contain natural measurement errors or adversarial noises. Since the observations …

Robust reinforcement learning using offline data

K Panaganti, Z Xu, D Kalathil… - Advances in neural …, 2022 - proceedings.neurips.cc
The goal of robust reinforcement learning (RL) is to learn a policy that is robust against the
uncertainty in model parameters. Parameter uncertainty commonly occurs in many real …

Policy gradient method for robust reinforcement learning

Y Wang, S Zou - International conference on machine …, 2022 - proceedings.mlr.press
This paper develops the first policy gradient method with global optimality guarantee and
complexity analysis for robust reinforcement learning under model mismatch. Robust …

The curious price of distributional robustness in reinforcement learning with a generative model

L Shi, G Li, Y Wei, Y Chen… - Advances in Neural …, 2023 - proceedings.neurips.cc
This paper investigates model robustness in reinforcement learning (RL) via the framework
of distributionally robust Markov decision processes (RMDPs). Despite recent efforts, the …

Step: Stochastic traversability evaluation and planning for risk-aware off-road navigation

DD Fan, K Otsu, Y Kubo, A Dixit, J Burdick… - arxiv preprint arxiv …, 2021 - arxiv.org
Although ground robotic autonomy has gained widespread usage in structured and
controlled environments, autonomy in unknown and off-road terrain remains a difficult …

Double pessimism is provably efficient for distributionally robust offline reinforcement learning: Generic algorithm and robust partial coverage

J Blanchet, M Lu, T Zhang… - Advances in Neural …, 2023 - proceedings.neurips.cc
We study distributionally robust offline reinforcement learning (RL), which seeks to find an
optimal robust policy purely from an offline dataset that can perform well in perturbed …

Certainty equivalence is efficient for linear quadratic control

H Mania, S Tu, B Recht - Advances in Neural Information …, 2019 - proceedings.neurips.cc
We study the performance of the certainty equivalent controller on Linear Quadratic (LQ)
control problems with unknown transition dynamics. We show that for both the fully and …