Survival instinct in offline reinforcement learning

A Li, D Misra, A Kolobov… - Advances in neural …, 2024 - proceedings.neurips.cc
We present a novel observation about the behavior of offline reinforcement learning (RL)
algorithms: on many benchmark datasets, offline RL can produce well-performing and safe …

Robust fitted-q-evaluation and iteration under sequentially exogenous unobserved confounders

D Bruns-Smith, A Zhou - arxiv preprint arxiv:2302.00662, 2023 - arxiv.org
Offline reinforcement learning is important in domains such as medicine, economics, and e-
commerce where online experimentation is costly, dangerous or unethical, and where the …

On the Limited Representational Power of Value Functions and its Links to Statistical (In) Efficiency

D Cheikhi, D Russo - arxiv preprint arxiv:2403.07136, 2024 - arxiv.org
Identifying the trade-offs between model-based and model-free methods is a central
question in reinforcement learning. Value-based methods offer substantial computational …

[BUCH][B] Exploiting Structure in Learning: A Path Toward Building Safe and Adaptive Robots

A Li - 2023 - search.proquest.com
As robots venture into real-world applications, there is an increasing need for them to
effectively learn from experience and adapt to unseen situations. This thesis addresses a …

EpiCare: A Reinforcement Learning Benchmark for Dynamic Treatment Regimes

M Hargrave, A Spaeth, L Grosenick - The Thirty-eight Conference on … - openreview.net
Healthcare applications pose significant challenges to existing reinforcement learning (RL)
methods due to implementation risks, low data availability, short treatment episodes, sparse …

Survival Instinct in Offline Reinforcement Learning and Implicit Human Bias in Data

A Li, D Misra, A Kolobov, CA Cheng - openreview.net
We present a novel observation about the behavior of offline reinforcement learning (RL)
algorithms: on many benchmark datasets, offline RL can produce well-performing and safe …