Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Robust reinforcement learning: A review of foundations and recent advances
Reinforcement learning (RL) has become a highly successful framework for learning in
Markov decision processes (MDP). Due to the adoption of RL in realistic and complex …
Markov decision processes (MDP). Due to the adoption of RL in realistic and complex …
Frameworks and results in distributionally robust optimization
The concepts of risk aversion, chance-constrained optimization, and robust optimization
have developed significantly over the last decade. The statistical learning community has …
have developed significantly over the last decade. The statistical learning community has …
Reinforcement learning in healthcare: A survey
As a subfield of machine learning, reinforcement learning (RL) aims at optimizing decision
making by using interaction samples of an agent with its environment and the potentially …
making by using interaction samples of an agent with its environment and the potentially …
Robust deep reinforcement learning against adversarial perturbations on state observations
A deep reinforcement learning (DRL) agent observes its states through observations, which
may contain natural measurement errors or adversarial noises. Since the observations …
may contain natural measurement errors or adversarial noises. Since the observations …
Robust reinforcement learning using offline data
The goal of robust reinforcement learning (RL) is to learn a policy that is robust against the
uncertainty in model parameters. Parameter uncertainty commonly occurs in many real …
uncertainty in model parameters. Parameter uncertainty commonly occurs in many real …
Policy gradient method for robust reinforcement learning
This paper develops the first policy gradient method with global optimality guarantee and
complexity analysis for robust reinforcement learning under model mismatch. Robust …
complexity analysis for robust reinforcement learning under model mismatch. Robust …
The curious price of distributional robustness in reinforcement learning with a generative model
L Shi, G Li, Y Wei, Y Chen… - Advances in Neural …, 2023 - proceedings.neurips.cc
This paper investigates model robustness in reinforcement learning (RL) via the framework
of distributionally robust Markov decision processes (RMDPs). Despite recent efforts, the …
of distributionally robust Markov decision processes (RMDPs). Despite recent efforts, the …
Step: Stochastic traversability evaluation and planning for risk-aware off-road navigation
Although ground robotic autonomy has gained widespread usage in structured and
controlled environments, autonomy in unknown and off-road terrain remains a difficult …
controlled environments, autonomy in unknown and off-road terrain remains a difficult …
Double pessimism is provably efficient for distributionally robust offline reinforcement learning: Generic algorithm and robust partial coverage
J Blanchet, M Lu, T Zhang… - Advances in Neural …, 2023 - proceedings.neurips.cc
We study distributionally robust offline reinforcement learning (RL), which seeks to find an
optimal robust policy purely from an offline dataset that can perform well in perturbed …
optimal robust policy purely from an offline dataset that can perform well in perturbed …
Certainty equivalence is efficient for linear quadratic control
We study the performance of the certainty equivalent controller on Linear Quadratic (LQ)
control problems with unknown transition dynamics. We show that for both the fully and …
control problems with unknown transition dynamics. We show that for both the fully and …