Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Decision transformer: Reinforcement learning via sequence modeling

L Chen, K Lu, A Rajeswaran, K Lee… - Advances in neural …, 2021 - proceedings.neurips.cc
We introduce a framework that abstracts Reinforcement Learning (RL) as a sequence
modeling problem. This allows us to draw upon the simplicity and scalability of the …

Online decision transformer

Q Zheng, A Zhang, A Grover - international conference on …, 2022 - proceedings.mlr.press
Recent work has shown that offline reinforcement learning (RL) can be formulated as a
sequence modeling problem (Chen et al., 2021; Janner et al., 2021) and solved via …

How to reuse and compose knowledge for a lifetime of tasks: A survey on continual learning and functional composition

JA Mendez, E Eaton - arxiv preprint arxiv:2207.07730, 2022 - arxiv.org
A major goal of artificial intelligence (AI) is to create an agent capable of acquiring a general
understanding of the world. Such an agent would require the ability to continually …

Transdreamer: Reinforcement learning with transformer world models

C Chen, YF Wu, J Yoon, S Ahn - arxiv preprint arxiv:2202.09481, 2022 - arxiv.org
The Dreamer agent provides various benefits of Model-Based Reinforcement Learning
(MBRL) such as sample efficiency, reusable knowledge, and safe planning. However, its …

Fastrlap: A system for learning high-speed driving via deep rl and autonomous practicing

K Stachowicz, D Shah, A Bhorkar… - … on Robot Learning, 2023 - proceedings.mlr.press
We present a system that enables an autonomous small-scale RC car to drive aggressively
from visual observations using reinforcement learning (RL). Our system, FastRLAP, trains …

Don't start from scratch: Leveraging prior data to automate robotic reinforcement learning

HR Walke, JH Yang, A Yu, A Kumar… - … on Robot Learning, 2023 - proceedings.mlr.press
Reinforcement learning (RL) algorithms hold the promise of enabling autonomous skill
acquisition for robotic systems. However, in practice, real-world robotic RL typically requires …

Skill-based model-based reinforcement learning

LX Shi, JJ Lim, Y Lee - arxiv preprint arxiv:2207.07560, 2022 - arxiv.org
Model-based reinforcement learning (RL) is a sample-efficient way of learning complex
behaviors by leveraging a learned single-step dynamics model to plan actions in …

Robot fine-tuning made easy: Pre-training rewards and policies for autonomous real-world reinforcement learning

J Yang, MS Mark, B Vu, A Sharma… - … on Robotics and …, 2024 - ieeexplore.ieee.org
The pre-train and fine-tune paradigm in machine learning has had dramatic success in a
wide range of domains because the use of existing data or pre-trained models on the …

Practice makes perfect: Planning to learn skill parameter policies

N Kumar, T Silver, W McClinton, L Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org
One promising approach towards effective robot decision making in complex, long-horizon
tasks is to sequence together parameterized skills. We consider a setting where a robot is …