- Academic Search

F Ni, J Hao, Y Mu, Y Yuan, Y Zheng… - International …, 2023 - proceedings.mlr.press

Recently, diffusion model shines as a promising backbone for the sequence modeling
paradigm in offline reinforcement learning (RL). However, these works mostly lack the …

Save Cite Cited by 43 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Ode-based recurrent model-free reinforcement learning for pomdps

X Zhao, D Zhang, H Liyuan… - Advances in Neural …, 2023 - proceedings.neurips.cc

Neural ordinary differential equations (ODEs) are widely recognized as the standard for
modeling physical mechanisms, which help to perform approximate inference in unknown …

Save Cite Cited by 8 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

DAG-Plan: Generating Directed Acyclic Dependency Graphs for Dual-Arm Cooperative Planning

Z Gao, Y Mu, J Qu, M Hu, L Guo, P Luo, Y Lu - arxiv preprint arxiv …, 2024 - arxiv.org

Dual-arm robots offer enhanced versatility and efficiency over single-arm counterparts by
enabling concurrent manipulation of multiple objects or cooperative execution of tasks using …

Save Cite Cited by 8 Related articles View as HTML

[Free GPT-4]

[PDF] aaai.org

MetaCARD: Meta-Reinforcement Learning with Task Uncertainty Feedback via Decoupled Context-Aware Reward and Dynamics Components

M Wang, X Li, L Zhang, M Wang - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

Meta-Reinforcement Learning (Meta-RL) aims to reveal shared characteristics in dynamics
and reward functions across diverse training tasks. This objective is achieved by meta …

Save Cite Cited by 1 Related articles View as HTML

[Free GPT-4]

[PDF] arxiv.org

Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning

X Yu, M Dunion, X Li, SV Albrecht - arxiv preprint arxiv:2406.04815, 2024 - arxiv.org

Meta-Reinforcement Learning (Meta-RL) agents can struggle to operate across tasks with
varying environmental features that require different optimal skills (ie, different modes of …

[Free GPT-4]

[PDF] arxiv.org

CausalCOMRL: Context-Based Offline Meta-Reinforcement Learning with Causal Representation

Z Zhang, W Meng, H Sun, G Pan - arxiv preprint arxiv:2502.00983, 2025 - arxiv.org

Context-based offline meta-reinforcement learning (OMRL) methods have achieved
appealing success by leveraging pre-collected offline datasets to develop task …

Save Cite Related articles View as HTML

[Free GPT-4]

[PDF] openreview.net

Skill-aware Mutual Information Optimisation for Zero-shot Generalisation in Reinforcement Learning

X Yu, M Dunion, X Li, SV Albrecht - The Thirty-eighth Annual Conference … - openreview.net

Meta-Reinforcement Learning (Meta-RL) agents can struggle to operate across tasks with
varying environmental features that require different optimal skills (ie, different modes of …

Save Cite Related articles View as HTML

IMLRLS: a method for ship collision avoidance by integrating meta-learning with reinforcement learning

X Jia, S Gao, W He - Third International Conference on …, 2024 - spiedigitallibrary.org

Autonomous collision avoidance is vital for intelligent ship navigation. To improve the
adaptability and effectiveness of collision avoidance policies, we propose a method that …

Dynamics Generalisation in Reinforcement Learning Through the Use of Adaptive Policies

M Beukman - 2023 - search.proquest.com

Reinforcement learning (RL) is a widely-used method for training agents to interact with an
external environment, and is commonly used in fields such as robotics. While RL has …

Save Cite Related articles

[CITATION][C] A Survey of Meta-Reinforcement Learning Research

陈奕宇，霍静，丁天雨，高阳 - Journal of Software, 2023

Create alert

Cite

Advanced search

Saved to My library

Domino: Decomposed mutual information optimization for generalized context in meta-reinforcement...

Metadiffuser: Diffusion model as conditional planner for offline meta-rl

Ode-based recurrent model-free reinforcement learning for pomdps

DAG-Plan: Generating Directed Acyclic Dependency Graphs for Dual-Arm Cooperative Planning

MetaCARD: Meta-Reinforcement Learning with Task Uncertainty Feedback via Decoupled Context-Aware Reward and Dynamics Components

Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning

CausalCOMRL: Context-Based Offline Meta-Reinforcement Learning with Causal Representation

Skill-aware Mutual Information Optimisation for Zero-shot Generalisation in Reinforcement Learning

IMLRLS: a method for ship collision avoidance by integrating meta-learning with reinforcement learning

Dynamics Generalisation in Reinforcement Learning Through the Use of Adaptive Policies

[CITATION][C] A Survey of Meta-Reinforcement Learning Research