- Academic Search

A Li, D Misra, A Kolobov… - Advances in neural …, 2024 - proceedings.neurips.cc

We present a novel observation about the behavior of offline reinforcement learning (RL)
algorithms: on many benchmark datasets, offline RL can produce well-performing and safe …

Speichern Zitieren Zitiert von: 20 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Robust fitted-q-evaluation and iteration under sequentially exogenous unobserved confounders

D Bruns-Smith, A Zhou - arxiv preprint arxiv:2302.00662, 2023 - arxiv.org

Offline reinforcement learning is important in domains such as medicine, economics, and e-
commerce where online experimentation is costly, dangerous or unethical, and where the …

Speichern Zitieren Zitiert von: 13 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

On the Limited Representational Power of Value Functions and its Links to Statistical (In) Efficiency

D Cheikhi, D Russo - arxiv preprint arxiv:2403.07136, 2024 - arxiv.org

Identifying the trade-offs between model-based and model-free methods is a central
question in reinforcement learning. Value-based methods offer substantial computational …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] washington.edu

[BUCH][B] Exploiting Structure in Learning: A Path Toward Building Safe and Adaptive Robots

A Li - 2023 - search.proquest.com

As robots venture into real-world applications, there is an increasing need for them to
effectively learn from experience and adapt to unseen situations. This thesis addresses a …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen Bibliothekssuche

[Free GPT-4]

[PDF] openreview.net

EpiCare: A Reinforcement Learning Benchmark for Dynamic Treatment Regimes

M Hargrave, A Spaeth, L Grosenick - The Thirty-eight Conference on … - openreview.net

Healthcare applications pose significant challenges to existing reinforcement learning (RL)
methods due to implementation risks, low data availability, short treatment episodes, sparse …

Speichern Zitieren Ähnliche Artikel HTML-Version

[Free GPT-4]

[PDF] openreview.net

Survival Instinct in Offline Reinforcement Learning and Implicit Human Bias in Data

A Li, D Misra, A Kolobov, CA Cheng - openreview.net

We present a novel observation about the behavior of offline reinforcement learning (RL)
algorithms: on many benchmark datasets, offline RL can produce well-performing and safe …

Speichern Zitieren Ähnliche Artikel HTML-Version

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Survival instinct in offline reinforcement learning

Robust fitted-q-evaluation and iteration under sequentially exogenous unobserved confounders

On the Limited Representational Power of Value Functions and its Links to Statistical (In) Efficiency

[BUCH][B] Exploiting Structure in Learning: A Path Toward Building Safe and Adaptive Robots

EpiCare: A Reinforcement Learning Benchmark for Dynamic Treatment Regimes

Survival Instinct in Offline Reinforcement Learning and Implicit Human Bias in Data