Академия Google

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Robust reinforcement learning: A review of foundations and recent advances

J Moos, K Hansel, H Abdulsamad, S Stark… - Machine Learning and …, 2022 - mdpi.com

Reinforcement learning (RL) has become a highly successful framework for learning in
Markov decision processes (MDP). Due to the adoption of RL in realistic and complex …

Сохранить Цитировать Цитируется: 143 Похожие статьи Все версии статьи (8) Сохраненная копия

[Free GPT-4]
[DeepSeek]

[PDF] numdam.org

Frameworks and results in distributionally robust optimization

H Rahimian, S Mehrotra - Open Journal of Mathematical Optimization, 2022 - numdam.org

The concepts of risk aversion, chance-constrained optimization, and robust optimization
have developed significantly over the last decade. The statistical learning community has …

Сохранить Цитировать Цитируется: 855 Похожие статьи Все версии статьи (13) Сохраненная копия

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reinforcement learning in healthcare: A survey

C Yu, J Liu, S Nemati, G Yin - ACM Computing Surveys (CSUR), 2021 - dl.acm.org

As a subfield of machine learning, reinforcement learning (RL) aims at optimizing decision
making by using interaction samples of an agent with its environment and the potentially …

Сохранить Цитировать Цитируется: 784 Похожие статьи Все версии статьи (5)

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Robust deep reinforcement learning against adversarial perturbations on state observations

H Zhang, H Chen, C **ao, B Li, M Liu… - Advances in …, 2020 - proceedings.neurips.cc

A deep reinforcement learning (DRL) agent observes its states through observations, which
may contain natural measurement errors or adversarial noises. Since the observations …

Сохранить Цитировать Цитируется: 354 Похожие статьи Все версии статьи (9) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Robust reinforcement learning using offline data

K Panaganti, Z Xu, D Kalathil… - Advances in neural …, 2022 - proceedings.neurips.cc

The goal of robust reinforcement learning (RL) is to learn a policy that is robust against the
uncertainty in model parameters. Parameter uncertainty commonly occurs in many real …

Сохранить Цитировать Цитируется: 86 Похожие статьи Все версии статьи (8) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Policy gradient method for robust reinforcement learning

Y Wang, S Zou - International conference on machine …, 2022 - proceedings.mlr.press

This paper develops the first policy gradient method with global optimality guarantee and
complexity analysis for robust reinforcement learning under model mismatch. Robust …

Сохранить Цитировать Цитируется: 75 Похожие статьи Все версии статьи (8) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

The curious price of distributional robustness in reinforcement learning with a generative model

L Shi, G Li, Y Wei, Y Chen… - Advances in Neural …, 2023 - proceedings.neurips.cc

This paper investigates model robustness in reinforcement learning (RL) via the framework
of distributionally robust Markov decision processes (RMDPs). Despite recent efforts, the …

Сохранить Цитировать Цитируется: 39 Похожие статьи Все версии статьи (12) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Step: Stochastic traversability evaluation and planning for risk-aware off-road navigation

DD Fan, K Otsu, Y Kubo, A Dixit, J Burdick… - arxiv preprint arxiv …, 2021 - arxiv.org

Although ground robotic autonomy has gained widespread usage in structured and
controlled environments, autonomy in unknown and off-road terrain remains a difficult …

Сохранить Цитировать Цитируется: 140 Похожие статьи Все версии статьи (8) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Double pessimism is provably efficient for distributionally robust offline reinforcement learning: Generic algorithm and robust partial coverage

J Blanchet, M Lu, T Zhang… - Advances in Neural …, 2023 - proceedings.neurips.cc

We study distributionally robust offline reinforcement learning (RL), which seeks to find an
optimal robust policy purely from an offline dataset that can perform well in perturbed …

Сохранить Цитировать Цитируется: 38 Похожие статьи Все версии статьи (7) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Certainty equivalence is efficient for linear quadratic control

H Mania, S Tu, B Recht - Advances in Neural Information …, 2019 - proceedings.neurips.cc

We study the performance of the certainty equivalent controller on Linear Quadratic (LQ)
control problems with unknown transition dynamics. We show that for both the fully and …

Сохранить Цитировать Цитируется: 251 Похожие статьи Все версии статьи (8) В виде HTML

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Robust reinforcement learning: A review of foundations and recent advances

Frameworks and results in distributionally robust optimization

Reinforcement learning in healthcare: A survey

Robust deep reinforcement learning against adversarial perturbations on state observations

Robust reinforcement learning using offline data

Policy gradient method for robust reinforcement learning

The curious price of distributional robustness in reinforcement learning with a generative model

Step: Stochastic traversability evaluation and planning for risk-aware off-road navigation

Double pessimism is provably efficient for distributionally robust offline reinforcement learning: Generic algorithm and robust partial coverage

Certainty equivalence is efficient for linear quadratic control