Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

A survey of zero-shot generalisation in deep reinforcement learning

R Kirk, A Zhang, E Grefenstette, T Rocktäschel - Journal of Artificial …, 2023 - jair.org
The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …

A domain-agnostic approach for characterization of lifelong learning systems

MM Baker, A New, M Aguilar-Simon, Z Al-Halah… - Neural Networks, 2023 - Elsevier
Despite the advancement of machine learning techniques in recent years, state-of-the-art
systems lack robustness to “real world” events, where the input distributions and tasks …

Deep Reinforcement learning for resilient power and energy systems: Progress, prospects, and future avenues

M Gautam - Electricity, 2023 - mdpi.com
In recent years, deep reinforcement learning (DRL) has garnered substantial attention in the
context of enhancing resilience in power and energy systems. Resilience, characterized by …

Building a subspace of policies for scalable continual learning

JB Gaya, T Doan, L Caccia, L Soulier… - arxiv preprint arxiv …, 2022 - arxiv.org
The ability to continuously acquire new knowledge and skills is crucial for autonomous
agents. Existing methods are typically based on either fixed-size models that struggle to …

Lifelong reinforcement learning with modulating masks

E Ben-Iwhiwhu, S Nath, PK Pilly, S Kolouri… - arxiv preprint arxiv …, 2022 - arxiv.org
Lifelong learning aims to create AI systems that continuously and incrementally learn during
a lifetime, similar to biological learning. Attempts so far have met problems, including …

COOM: a game benchmark for continual reinforcement learning

T Tomilin, M Fang, Y Zhang… - Advances in Neural …, 2024 - proceedings.neurips.cc
The advancement of continual reinforcement learning (RL) has been facing various
obstacles, including standardized metrics and evaluation protocols, demanding …

Efficient online reinforcement learning fine-tuning need not retain offline data

Z Zhou, A Peng, Q Li, S Levine, A Kumar - arxiv preprint arxiv:2412.07762, 2024 - arxiv.org
The modern paradigm in machine learning involves pre-training on diverse data, followed
by task-specific fine-tuning. In reinforcement learning (RL), this translates to learning via …

Dynamic dialogue policy for continual reinforcement learning

C Geishauser, C van Niekerk, N Lubis, M Heck… - arxiv preprint arxiv …, 2022 - arxiv.org
Continual learning is one of the key components of human learning and a necessary
requirement of artificial intelligence. As dialogue can potentially span infinitely many topics …

Learning with an Open Horizon in Ever-Changing Dialogue Circumstances

C Geishauser, C van Niekerk, N Lubis… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Task-orienteddialogue systems aid users in achieving their goals for specific tasks, eg,
booking a hotel room or managing a schedule. The systems experience various changes …