A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z **ong, L Zintgraf… - arxiv preprint arxiv …, 2023 - arxiv.org
While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

Causal multi-agent reinforcement learning: Review and open problems

SJ Grimbly, J Shock, A Pretorius - arxiv preprint arxiv:2111.06721, 2021 - arxiv.org
This paper serves to introduce the reader to the field of multi-agent reinforcement learning
(MARL) and its intersection with methods from the study of causality. We highlight key …

Deep reinforcement learning amidst continual structured non-stationarity

A **e, J Harrison, C Finn - International Conference on …, 2021 - proceedings.mlr.press
As humans, our goals and our environment are persistently changing throughout our lifetime
based on our experiences, actions, and internal and external drives. In contrast, typical …

On the effectiveness of fine-tuning versus meta-reinforcement learning

Z Mandi, P Abbeel, S James - arxiv preprint arxiv:2206.03271, 2022 - arxiv.org
Intelligent agents should have the ability to leverage knowledge from previously learned
tasks in order to learn new ones quickly and efficiently. Meta-learning approaches have …

Model-based meta reinforcement learning using graph structured surrogate models and amortized policy search

Q Wang, H Van Hoof - International Conference on Machine …, 2022 - proceedings.mlr.press
Reinforcement learning is a promising paradigm for solving sequential decision-making
problems, but low data efficiency and weak generalization across tasks are bottlenecks in …

Test-time adaptation via self-training with nearest neighbor information

M Jang, SY Chung, HW Chung - arxiv preprint arxiv:2207.10792, 2022 - arxiv.org
Test-time adaptation (TTA) aims to adapt a trained classifier using online unlabeled test data
only, without any information related to the training procedure. Most existing TTA methods …

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

R Zhou, CX Gao, Z Zhang, Y Yu - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Generalization and sample efficiency have been long-standing issues concerning
reinforcement learning, and thus the field of Offline Meta-Reinforcement Learning (OMRL) …

Model-based adversarial meta-reinforcement learning

Z Lin, G Thomas, G Yang, T Ma - Advances in Neural …, 2020 - proceedings.neurips.cc
Meta-reinforcement learning (meta-RL) aims to learn from multiple training tasks the ability
to adapt efficiently to unseen test tasks. Despite the success, existing meta-RL algorithms …

On the effectiveness of fine-tuning versus meta-reinforcement learning

M Zhao, P Abbeel, S James - Advances in neural …, 2022 - proceedings.neurips.cc
Intelligent agents should have the ability to leverage knowledge from previously learned
tasks in order to learn new ones quickly and efficiently. Meta-learning approaches have …

Offline meta reinforcement learning with in-distribution online adaptation

J Wang, J Zhang, H Jiang, J Zhang… - International …, 2023 - proceedings.mlr.press
Recent offline meta-reinforcement learning (meta-RL) methods typically utilize task-
dependent behavior policies (eg, training RL agents on each individual task) to collect a …