Transfer learning in deep reinforcement learning: A survey

Z Zhu, K Lin, AK Jain, J Zhou - IEEE Transactions on Pattern …, 2023 - ieeexplore.ieee.org
Reinforcement learning is a learning paradigm for solving sequential decision-making
problems. Recent years have witnessed remarkable progress in reinforcement learning …

A tutorial on sparse Gaussian processes and variational inference

F Leibfried, V Dutordoir, ST John… - arxiv preprint arxiv …, 2020 - arxiv.org
Gaussian processes (GPs) provide a framework for Bayesian inference that can offer
principled uncertainty estimates for a large range of problems. For example, if we consider …

Unsupervised domain adaptation with dynamics-aware rewards in reinforcement learning

J Liu, H Shen, D Wang, Y Kang… - Advances in Neural …, 2021 - proceedings.neurips.cc
Unsupervised reinforcement learning aims to acquire skills without prior goal
representations, where an agent automatically explores an open-ended environment to …

[PDF][PDF] Fast adaptation to new environments via policy-dynamics value functions

R Raileanu, M Goldstein, A Szlam… - Proceedings of the 37th …, 2020 - scholar.archive.org
Standard RL algorithms assume fixed environment dynamics and require a significant
amount of interaction to adapt to new environments. We introduce Policy-Dynamics Value …

Uncertainty estimation using riemannian model dynamics for offline reinforcement learning

G Tennenholtz, S Mannor - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Abstract Model-based offline reinforcement learning approaches generally rely on bounds of
model error. Estimating these bounds is usually achieved through uncertainty estimation …

Multi-agent policy transfer via task relationship modeling

R Qin, F Chen, T Wang, L Yuan, X Wu, Y Kang… - Science China …, 2024 - Springer
Team adaptation to new cooperative tasks is a hallmark of human intelligence, which has
yet to be fully realized in learning agents. Previous studies on multi-agent transfer learning …

A survey on deep reinforcement learning-based approaches for adaptation and generalization

P Yadav, A Mishra, J Lee, S Kim - arxiv preprint arxiv:2202.08444, 2022 - arxiv.org
Deep Reinforcement Learning (DRL) aims to create intelligent agents that can learn to solve
complex problems efficiently in a real-world environment. Typically, two learning goals …

Fast adaptation via policy-dynamics value functions

R Raileanu, M Goldstein, A Szlam, R Fergus - arxiv preprint arxiv …, 2020 - arxiv.org
Standard RL algorithms assume fixed environment dynamics and require a significant
amount of interaction to adapt to new environments. We introduce Policy-Dynamics Value …

Implicit Ensemble Training for Efficient and Robust Multiagent Reinforcement Learning

M Shen, JP How - Transactions on Machine Learning Research, 2023 - openreview.net
An important issue in competitive multiagent scenarios is the distribution mismatch between
training and testing caused by variations in other agents' policies. As a result, policies …

[HTML][HTML] Learning unsupervised disentangled skill latents to adapt unseen task and morphological modifications

T Kim, P Yadav, H Suk, S Kim - Engineering Applications of Artificial …, 2022 - Elsevier
Learning adaptable policies in the absence of explicit reward signals is a challenging
problem in reinforcement learning. We propose an algorithm that disentangles the …