Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z **ong, L Zintgraf… - arxiv preprint arxiv …, 2023 - arxiv.org
While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

R Zhou, CX Gao, Z Zhang, Y Yu - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Generalization and sample efficiency have been long-standing issues concerning
reinforcement learning, and thus the field of Offline Meta-Reinforcement Learning (OMRL) …

Context shift reduction for offline meta-reinforcement learning

Y Gao, R Zhang, J Guo, F Wu, Q Yi… - Advances in …, 2024 - proceedings.neurips.cc
Offline meta-reinforcement learning (OMRL) utilizes pre-collected offline datasets to
enhance the agent's generalization ability on unseen tasks. However, the context shift …

Efficient symbolic policy learning with differentiable symbolic expression

J Guo, R Zhang, S Peng, Q Yi, X Hu… - Advances in …, 2024 - proceedings.neurips.cc
Deep reinforcement learning (DRL) has led to a wide range of advances in sequential
decision-making tasks. However, the complexity of neural network policies makes it difficult …

Contextualize Me--The Case for Context in Reinforcement Learning

C Benjamins, T Eimer, F Schubert, A Mohan… - arxiv preprint arxiv …, 2022 - arxiv.org
While Reinforcement Learning (RL) has made great strides towards solving increasingly
complicated problems, many algorithms are still brittle to even slight environmental changes …

A contrastive-enhanced ensemble framework for efficient multi-agent reinforcement learning

X Du, H Chen, Y **ng, SY Philip, L He - Expert Systems with Applications, 2024 - Elsevier
Multi-agent reinforcement learning is promising for real-world applications as it encourages
agents to perceive and interact with their surrounding environment autonomously. However …

[PDF][PDF] First-explore, then exploit: Meta-learning intelligent exploration

B Norman, J Clune - arxiv preprint arxiv:2307.02276, 2023 - thetalkingmachines.com
Standard reinforcement learning (RL) agents never intelligently explore like a human (ie by
taking into account complex domain priors and previous explorations). Even the most basic …

Contrabar: Contrastive bayes-adaptive deep rl

E Choshen, A Tamar - International Conference on Machine …, 2023 - proceedings.mlr.press
In meta reinforcement learning (meta RL), an agent seeks a Bayes-optimal policy–the
optimal policy when facing an unknown task that is sampled from some known task …

Domino: Decomposed mutual information optimization for generalized context in meta-reinforcement learning

Y Mu, Y Zhuang, F Ni, B Wang… - Advances in Neural …, 2022 - proceedings.neurips.cc
Adapting to the changes in transition dynamics is essential in robotic applications. By
learning a conditional policy with a compact context, context-aware meta-reinforcement …