Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey on offline reinforcement learning: Taxonomy, review, and open problems
RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …
experienced a dramatic increase in popularity, scaling to previously intractable problems …
A minimalist approach to offline reinforcement learning
Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …
What matters in learning from offline human demonstrations for robot manipulation
Imitating human demonstrations is a promising approach to endow robots with various
manipulation capabilities. While recent advances have been made in imitation learning and …
manipulation capabilities. While recent advances have been made in imitation learning and …
Rambo-rl: Robust adversarial model-based offline reinforcement learning
Offline reinforcement learning (RL) aims to find performant policies from logged data without
further environment interaction. Model-based algorithms, which learn a model of the …
further environment interaction. Model-based algorithms, which learn a model of the …
Offline-to-online reinforcement learning via balanced replay and pessimistic q-ensemble
Recent advance in deep offline reinforcement learning (RL) has made it possible to train
strong robotic agents from offline datasets. However, depending on the quality of the trained …
strong robotic agents from offline datasets. However, depending on the quality of the trained …
Rvs: What is essential for offline rl via supervised learning?
Recent work has shown that supervised learning alone, without temporal difference (TD)
learning, can be remarkably effective for offline RL. When does this hold true, and which …
learning, can be remarkably effective for offline RL. When does this hold true, and which …
Adversarially trained actor critic for offline reinforcement learning
Abstract We propose Adversarially Trained Actor Critic (ATAC), a new model-free algorithm
for offline reinforcement learning (RL) under insufficient data coverage, based on the …
for offline reinforcement learning (RL) under insufficient data coverage, based on the …
Q-learning decision transformer: Leveraging dynamic programming for conditional sequence modelling in offline rl
Recent works have shown that tackling offline reinforcement learning (RL) with a conditional
policy produces promising results. The Decision Transformer (DT) combines the conditional …
policy produces promising results. The Decision Transformer (DT) combines the conditional …
Acme: A research framework for distributed reinforcement learning
Deep reinforcement learning (RL) has led to many recent and groundbreaking advances.
However, these advances have often come at the cost of both increased scale in the …
However, these advances have often come at the cost of both increased scale in the …
A practical deep reinforcement learning framework for multivariate occupant-centric control in buildings
Reinforcement learning (RL) has been shown to have the potential for optimal control of
heating, ventilation, and air conditioning (HVAC) systems. Although research on RL-based …
heating, ventilation, and air conditioning (HVAC) systems. Although research on RL-based …