Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z **ong, L Zintgraf… - arxiv preprint arxiv …, 2023 - arxiv.org
While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

How to reuse and compose knowledge for a lifetime of tasks: A survey on continual learning and functional composition

JA Mendez, E Eaton - arxiv preprint arxiv:2207.07730, 2022 - arxiv.org
A major goal of artificial intelligence (AI) is to create an agent capable of acquiring a general
understanding of the world. Such an agent would require the ability to continually …

Bootstrap your own skills: Learning to solve new tasks with large language model guidance

J Zhang, J Zhang, K Pertsch, Z Liu, X Ren… - arxiv preprint arxiv …, 2023 - arxiv.org
We propose BOSS, an approach that automatically learns to solve new long-horizon,
complex, and meaningful tasks by growing a learned skill library with minimal supervision …

Hyper-decision transformer for efficient online policy adaptation

M Xu, Y Lu, Y Shen, S Zhang, D Zhao… - arxiv preprint arxiv …, 2023 - arxiv.org
Decision Transformers (DT) have demonstrated strong performances in offline reinforcement
learning settings, but quickly adapting to unseen novel tasks remains challenging. To …

Learning to discover skills through guidance

H Kim, BK Lee, H Lee, D Hwang… - Advances in …, 2024 - proceedings.neurips.cc
In the field of unsupervised skill discovery (USD), a major challenge is limited exploration,
primarily due to substantial penalties when skills deviate from their initial trajectories. To …

Learning options via compression

Y Jiang, E Liu, B Eysenbach… - Advances in Neural …, 2022 - proceedings.neurips.cc
Identifying statistical regularities in solutions to some tasks in multi-task reinforcement
learning can accelerate the learning of new tasks. Skill learning offers one way of identifying …

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - arxiv preprint arxiv:2306.16021, 2023 - academia.edu
Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

Sprint: Scalable policy pre-training via language instruction relabeling

J Zhang, K Pertsch, J Zhang… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Pre-training robots with a rich set of skills can substantially accelerate the learning of
downstream tasks. Prior works have defined pre-training tasks via natural language …

Flow to control: Offline reinforcement learning with lossless primitive discovery

Y Yang, H Hu, W Li, S Li, J Yang, Q Zhao… - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Offline reinforcement learning (RL) enables the agent to effectively learn from logged data,
which significantly extends the applicability of RL algorithms in real-world scenarios where …