- Academic Search

Towards effective context for meta-reinforcement learning: an approach based on contrastive learning

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Save Cite Cited by 335 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z **ong, L Zintgraf… - arxiv preprint arxiv …, 2023 - arxiv.org

While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

Save Cite Cited by 167 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aaai.org

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

R Zhou, CX Gao, Z Zhang, Y Yu - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

Generalization and sample efficiency have been long-standing issues concerning
reinforcement learning, and thus the field of Offline Meta-Reinforcement Learning (OMRL) …

Save Cite Cited by 12 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Context shift reduction for offline meta-reinforcement learning

Y Gao, R Zhang, J Guo, F Wu, Q Yi… - Advances in …, 2024 - proceedings.neurips.cc

Offline meta-reinforcement learning (OMRL) utilizes pre-collected offline datasets to
enhance the agent's generalization ability on unseen tasks. However, the context shift …

Save Cite Cited by 11 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Efficient symbolic policy learning with differentiable symbolic expression

J Guo, R Zhang, S Peng, Q Yi, X Hu… - Advances in …, 2024 - proceedings.neurips.cc

Deep reinforcement learning (DRL) has led to a wide range of advances in sequential
decision-making tasks. However, the complexity of neural network policies makes it difficult …

Save Cite Cited by 4 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Contextualize Me--The Case for Context in Reinforcement Learning

C Benjamins, T Eimer, F Schubert, A Mohan… - arxiv preprint arxiv …, 2022 - arxiv.org

While Reinforcement Learning (RL) has made great strides towards solving increasingly
complicated problems, many algorithms are still brittle to even slight environmental changes …

Save Cite Cited by 40 Related articles All 8 versions Free GPT-4 View as HTML

A contrastive-enhanced ensemble framework for efficient multi-agent reinforcement learning

X Du, H Chen, Y **ng, SY Philip, L He - Expert Systems with Applications, 2024 - Elsevier

Multi-agent reinforcement learning is promising for real-world applications as it encourages
agents to perceive and interact with their surrounding environment autonomously. However …

Save Cite Cited by 3 Related articles

[Free GPT-4]

[PDF] thetalkingmachines.com

[PDF][PDF] First-explore, then exploit: Meta-learning intelligent exploration

B Norman, J Clune - arxiv preprint arxiv:2307.02276, 2023 - thetalkingmachines.com

Standard reinforcement learning (RL) agents never intelligently explore like a human (ie by
taking into account complex domain priors and previous explorations). Even the most basic …

Save Cite Cited by 8 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Contrabar: Contrastive bayes-adaptive deep rl

E Choshen, A Tamar - International Conference on Machine …, 2023 - proceedings.mlr.press

In meta reinforcement learning (meta RL), an agent seeks a Bayes-optimal policy–the
optimal policy when facing an unknown task that is sampled from some known task …

Save Cite Cited by 6 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Domino: Decomposed mutual information optimization for generalized context in meta-reinforcement learning

Y Mu, Y Zhuang, F Ni, B Wang… - Advances in Neural …, 2022 - proceedings.neurips.cc

Adapting to the changes in transition dynamics is essential in robotic applications. By
learning a conditional policy with a compact context, context-aware meta-reinforcement …

Save Cite Cited by 12 Related articles All 6 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Towards effective context for meta-reinforcement learning: an approach based on contrastive learning

Towards continual reinforcement learning: A review and perspectives

A survey of meta-reinforcement learning

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

Context shift reduction for offline meta-reinforcement learning

Efficient symbolic policy learning with differentiable symbolic expression

Contextualize Me--The Case for Context in Reinforcement Learning

A contrastive-enhanced ensemble framework for efficient multi-agent reinforcement learning

[PDF][PDF] First-explore, then exploit: Meta-learning intelligent exploration

Contrabar: Contrastive bayes-adaptive deep rl

Domino: Decomposed mutual information optimization for generalized context in meta-reinforcement learning