A survey on offline reinforcement learning: Taxonomy, review, and open problems

RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …

A minimalist approach to offline reinforcement learning

S Fujimoto, SS Gu - Advances in neural information …, 2021 - proceedings.neurips.cc
Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …

Critic regularized regression

Z Wang, A Novikov, K Zolna, JS Merel… - Advances in …, 2020 - proceedings.neurips.cc
Offline reinforcement learning (RL), also known as batch RL, offers the prospect of policy
optimization from large pre-recorded datasets without online environment interaction. It …

d3rlpy: An offline deep reinforcement learning library

T Seno, M Imai - Journal of Machine Learning Research, 2022 - jmlr.org
In this paper, we introduce d3rlpy, an open-sourced offline deep reinforcement learning (RL)
library for Python. d3rlpy supports a set of offline deep RL algorithms as well as off-policy …

An optimistic perspective on offline reinforcement learning

R Agarwal, D Schuurmans… - … conference on machine …, 2020 - proceedings.mlr.press
Off-policy reinforcement learning (RL) using a fixed offline dataset of logged interactions is
an important consideration in real world applications. This paper studies offline RL using the …

A theoretical analysis of deep Q-learning

J Fan, Z Wang, Y **e, Z Yang - Learning for dynamics and …, 2020 - proceedings.mlr.press
Despite the great empirical success of deep reinforcement learning, its theoretical
foundation is less well understood. In this work, we make the first attempt to theoretically …

Offline rl without off-policy evaluation

D Brandfonbrener, W Whitney… - Advances in neural …, 2021 - proceedings.neurips.cc
Most prior approaches to offline reinforcement learning (RL) have taken an iterative actor-
critic approach involving off-policy evaluation. In this paper we show that simply doing one …

Rl unplugged: A suite of benchmarks for offline reinforcement learning

C Gulcehre, Z Wang, A Novikov… - Advances in …, 2020 - proceedings.neurips.cc
Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …

Alleviating matthew effect of offline reinforcement learning in interactive recommendation

C Gao, K Huang, J Chen, Y Zhang, B Li… - Proceedings of the 46th …, 2023 - dl.acm.org
Offline reinforcement learning (RL), a technology that offline learns a policy from logged data
without the need to interact with online environments, has become a favorable choice in …

Model selection for offline reinforcement learning: Practical considerations for healthcare settings

S Tang, J Wiens - Machine Learning for Healthcare …, 2021 - proceedings.mlr.press
Reinforcement learning (RL) can be used to learn treatment policies and aid decision
making in healthcare. However, given the need for generalization over complex state/action …