A survey on offline reinforcement learning: Taxonomy, review, and open problems
RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …
experienced a dramatic increase in popularity, scaling to previously intractable problems …
A minimalist approach to offline reinforcement learning
Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …
Critic regularized regression
Offline reinforcement learning (RL), also known as batch RL, offers the prospect of policy
optimization from large pre-recorded datasets without online environment interaction. It …
optimization from large pre-recorded datasets without online environment interaction. It …
d3rlpy: An offline deep reinforcement learning library
In this paper, we introduce d3rlpy, an open-sourced offline deep reinforcement learning (RL)
library for Python. d3rlpy supports a set of offline deep RL algorithms as well as off-policy …
library for Python. d3rlpy supports a set of offline deep RL algorithms as well as off-policy …
An optimistic perspective on offline reinforcement learning
Off-policy reinforcement learning (RL) using a fixed offline dataset of logged interactions is
an important consideration in real world applications. This paper studies offline RL using the …
an important consideration in real world applications. This paper studies offline RL using the …
A theoretical analysis of deep Q-learning
Despite the great empirical success of deep reinforcement learning, its theoretical
foundation is less well understood. In this work, we make the first attempt to theoretically …
foundation is less well understood. In this work, we make the first attempt to theoretically …
Offline rl without off-policy evaluation
Most prior approaches to offline reinforcement learning (RL) have taken an iterative actor-
critic approach involving off-policy evaluation. In this paper we show that simply doing one …
critic approach involving off-policy evaluation. In this paper we show that simply doing one …
Rl unplugged: A suite of benchmarks for offline reinforcement learning
Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …
reinforcement learning research and real-world applications. They make it possible to learn …
Alleviating matthew effect of offline reinforcement learning in interactive recommendation
Offline reinforcement learning (RL), a technology that offline learns a policy from logged data
without the need to interact with online environments, has become a favorable choice in …
without the need to interact with online environments, has become a favorable choice in …
Model selection for offline reinforcement learning: Practical considerations for healthcare settings
Reinforcement learning (RL) can be used to learn treatment policies and aid decision
making in healthcare. However, given the need for generalization over complex state/action …
making in healthcare. However, given the need for generalization over complex state/action …