- Academic Search

Z Zhu, K Lin, AK Jain, J Zhou - IEEE Transactions on Pattern …, 2023 - ieeexplore.ieee.org

Reinforcement learning is a learning paradigm for solving sequential decision-making
problems. Recent years have witnessed remarkable progress in reinforcement learning …

Save Cite Cited by 749 Related articles All 12 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

A tutorial on sparse Gaussian processes and variational inference

F Leibfried, V Dutordoir, ST John… - arxiv preprint arxiv …, 2020 - arxiv.org

Gaussian processes (GPs) provide a framework for Bayesian inference that can offer
principled uncertainty estimates for a large range of problems. For example, if we consider …

Save Cite Cited by 57 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Unsupervised domain adaptation with dynamics-aware rewards in reinforcement learning

J Liu, H Shen, D Wang, Y Kang… - Advances in Neural …, 2021 - proceedings.neurips.cc

Unsupervised reinforcement learning aims to acquire skills without prior goal
representations, where an agent automatically explores an open-ended environment to …

Save Cite Cited by 22 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] archive.org

[PDF][PDF] Fast adaptation to new environments via policy-dynamics value functions

R Raileanu, M Goldstein, A Szlam… - Proceedings of the 37th …, 2020 - scholar.archive.org

Standard RL algorithms assume fixed environment dynamics and require a significant
amount of interaction to adapt to new environments. We introduce Policy-Dynamics Value …

Save Cite Cited by 29 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Uncertainty estimation using riemannian model dynamics for offline reinforcement learning

G Tennenholtz, S Mannor - Advances in Neural Information …, 2022 - proceedings.neurips.cc

Abstract Model-based offline reinforcement learning approaches generally rely on bounds of
model error. Estimating these bounds is usually achieved through uncertainty estimation …

Save Cite Cited by 13 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Multi-agent policy transfer via task relationship modeling

R Qin, F Chen, T Wang, L Yuan, X Wu, Y Kang… - Science China …, 2024 - Springer

Team adaptation to new cooperative tasks is a hallmark of human intelligence, which has
yet to be fully realized in learning agents. Previous studies on multi-agent transfer learning …

Save Cite Cited by 8 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

A survey on deep reinforcement learning-based approaches for adaptation and generalization

P Yadav, A Mishra, J Lee, S Kim - arxiv preprint arxiv:2202.08444, 2022 - arxiv.org

Deep Reinforcement Learning (DRL) aims to create intelligent agents that can learn to solve
complex problems efficiently in a real-world environment. Typically, two learning goals …

Save Cite Cited by 6 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Fast adaptation via policy-dynamics value functions

R Raileanu, M Goldstein, A Szlam, R Fergus - arxiv preprint arxiv …, 2020 - arxiv.org

Standard RL algorithms assume fixed environment dynamics and require a significant
amount of interaction to adapt to new environments. We introduce Policy-Dynamics Value …

Save Cite Cited by 12 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] openreview.net

Implicit Ensemble Training for Efficient and Robust Multiagent Reinforcement Learning

M Shen, JP How - Transactions on Machine Learning Research, 2023 - openreview.net

An important issue in competitive multiagent scenarios is the distribution mismatch between
training and testing caused by variations in other agents' policies. As a result, policies …

Save Cite Cited by 3 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Learning unsupervised disentangled skill latents to adapt unseen task and morphological modifications

T Kim, P Yadav, H Suk, S Kim - Engineering Applications of Artificial …, 2022 - Elsevier

Learning adaptable policies in the absence of explicit reward signals is a challenging
problem in reinforcement learning. We propose an algorithm that disentangles the …

Save Cite Cited by 1 Related articles All 3 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

Disentangled skill embeddings for reinforcement learning

Transfer learning in deep reinforcement learning: A survey

A tutorial on sparse Gaussian processes and variational inference

Unsupervised domain adaptation with dynamics-aware rewards in reinforcement learning

[PDF][PDF] Fast adaptation to new environments via policy-dynamics value functions

Uncertainty estimation using riemannian model dynamics for offline reinforcement learning

Multi-agent policy transfer via task relationship modeling

A survey on deep reinforcement learning-based approaches for adaptation and generalization

Fast adaptation via policy-dynamics value functions

Implicit Ensemble Training for Efficient and Robust Multiagent Reinforcement Learning

[HTML][HTML] Learning unsupervised disentangled skill latents to adapt unseen task and morphological modifications