Survival instinct in offline reinforcement learning
We present a novel observation about the behavior of offline reinforcement learning (RL)
algorithms: on many benchmark datasets, offline RL can produce well-performing and safe …
algorithms: on many benchmark datasets, offline RL can produce well-performing and safe …
Robust fitted-q-evaluation and iteration under sequentially exogenous unobserved confounders
Offline reinforcement learning is important in domains such as medicine, economics, and e-
commerce where online experimentation is costly, dangerous or unethical, and where the …
commerce where online experimentation is costly, dangerous or unethical, and where the …
On the Limited Representational Power of Value Functions and its Links to Statistical (In) Efficiency
Identifying the trade-offs between model-based and model-free methods is a central
question in reinforcement learning. Value-based methods offer substantial computational …
question in reinforcement learning. Value-based methods offer substantial computational …
[BUCH][B] Exploiting Structure in Learning: A Path Toward Building Safe and Adaptive Robots
A Li - 2023 - search.proquest.com
As robots venture into real-world applications, there is an increasing need for them to
effectively learn from experience and adapt to unseen situations. This thesis addresses a …
effectively learn from experience and adapt to unseen situations. This thesis addresses a …
EpiCare: A Reinforcement Learning Benchmark for Dynamic Treatment Regimes
Healthcare applications pose significant challenges to existing reinforcement learning (RL)
methods due to implementation risks, low data availability, short treatment episodes, sparse …
methods due to implementation risks, low data availability, short treatment episodes, sparse …
Survival Instinct in Offline Reinforcement Learning and Implicit Human Bias in Data
We present a novel observation about the behavior of offline reinforcement learning (RL)
algorithms: on many benchmark datasets, offline RL can produce well-performing and safe …
algorithms: on many benchmark datasets, offline RL can produce well-performing and safe …