Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey on offline reinforcement learning: Taxonomy, review, and open problems
RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …
experienced a dramatic increase in popularity, scaling to previously intractable problems …
Transferring policy of deep reinforcement learning from simulation to reality for robotics
H Ju, R Juan, R Gomez, K Nakamura… - Nature Machine …, 2022 - nature.com
Deep reinforcement learning has achieved great success in many fields and has shown
promise in learning robust skills for robot control in recent years. However, sampling …
promise in learning robust skills for robot control in recent years. However, sampling …
Efficient online reinforcement learning with offline data
Sample efficiency and exploration remain major challenges in online reinforcement learning
(RL). A powerful approach that can be applied to address these issues is the inclusion of …
(RL). A powerful approach that can be applied to address these issues is the inclusion of …
Cal-ql: Calibrated offline rl pre-training for efficient online fine-tuning
A compelling use case of offline reinforcement learning (RL) is to obtain a policy initialization
from existing datasets followed by fast online fine-tuning with limited interaction. However …
from existing datasets followed by fast online fine-tuning with limited interaction. However …
Online decision transformer
Recent work has shown that offline reinforcement learning (RL) can be formulated as a
sequence modeling problem (Chen et al., 2021; Janner et al., 2021) and solved via …
sequence modeling problem (Chen et al., 2021; Janner et al., 2021) and solved via …
Mildly conservative q-learning for offline reinforcement learning
Offline reinforcement learning (RL) defines the task of learning from a static logged dataset
without continually interacting with the environment. The distribution shift between the …
without continually interacting with the environment. The distribution shift between the …
Pessimistic bootstrap** for uncertainty-driven offline reinforcement learning
Offline Reinforcement Learning (RL) aims to learn policies from previously collected
datasets without exploring the environment. Directly applying off-policy algorithms to offline …
datasets without exploring the environment. Directly applying off-policy algorithms to offline …
Hybrid rl: Using both offline and online data can make rl efficient
We consider a hybrid reinforcement learning setting (Hybrid RL), in which an agent has
access to an offline dataset and the ability to collect experience via real-world online …
access to an offline dataset and the ability to collect experience via real-world online …
Chipformer: Transferable chip placement via offline decision transformer
Placement is a critical step in modern chip design, aiming to determine the positions of
circuit modules on the chip canvas. Recent works have shown that reinforcement learning …
circuit modules on the chip canvas. Recent works have shown that reinforcement learning …
Pre-training for robots: Offline rl enables learning new tasks from a handful of trials
Progress in deep learning highlights the tremendous potential of utilizing diverse robotic
datasets for attaining effective generalization and makes it enticing to consider leveraging …
datasets for attaining effective generalization and makes it enticing to consider leveraging …