Metadiffuser: Diffusion model as conditional planner for offline meta-rl

F Ni, J Hao, Y Mu, Y Yuan, Y Zheng… - International …, 2023 - proceedings.mlr.press
Recently, diffusion model shines as a promising backbone for the sequence modeling
paradigm in offline reinforcement learning (RL). However, these works mostly lack the …

Ode-based recurrent model-free reinforcement learning for pomdps

X Zhao, D Zhang, H Liyuan… - Advances in Neural …, 2023 - proceedings.neurips.cc
Neural ordinary differential equations (ODEs) are widely recognized as the standard for
modeling physical mechanisms, which help to perform approximate inference in unknown …

DAG-Plan: Generating Directed Acyclic Dependency Graphs for Dual-Arm Cooperative Planning

Z Gao, Y Mu, J Qu, M Hu, L Guo, P Luo, Y Lu - arxiv preprint arxiv …, 2024 - arxiv.org
Dual-arm robots offer enhanced versatility and efficiency over single-arm counterparts by
enabling concurrent manipulation of multiple objects or cooperative execution of tasks using …

MetaCARD: Meta-Reinforcement Learning with Task Uncertainty Feedback via Decoupled Context-Aware Reward and Dynamics Components

M Wang, X Li, L Zhang, M Wang - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Meta-Reinforcement Learning (Meta-RL) aims to reveal shared characteristics in dynamics
and reward functions across diverse training tasks. This objective is achieved by meta …

Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning

X Yu, M Dunion, X Li, SV Albrecht - arxiv preprint arxiv:2406.04815, 2024 - arxiv.org
Meta-Reinforcement Learning (Meta-RL) agents can struggle to operate across tasks with
varying environmental features that require different optimal skills (ie, different modes of …

CausalCOMRL: Context-Based Offline Meta-Reinforcement Learning with Causal Representation

Z Zhang, W Meng, H Sun, G Pan - arxiv preprint arxiv:2502.00983, 2025 - arxiv.org
Context-based offline meta-reinforcement learning (OMRL) methods have achieved
appealing success by leveraging pre-collected offline datasets to develop task …

Skill-aware Mutual Information Optimisation for Zero-shot Generalisation in Reinforcement Learning

X Yu, M Dunion, X Li, SV Albrecht - The Thirty-eighth Annual Conference … - openreview.net
Meta-Reinforcement Learning (Meta-RL) agents can struggle to operate across tasks with
varying environmental features that require different optimal skills (ie, different modes of …

IMLRLS: a method for ship collision avoidance by integrating meta-learning with reinforcement learning

X Jia, S Gao, W He - Third International Conference on …, 2024 - spiedigitallibrary.org
Autonomous collision avoidance is vital for intelligent ship navigation. To improve the
adaptability and effectiveness of collision avoidance policies, we propose a method that …

Dynamics Generalisation in Reinforcement Learning Through the Use of Adaptive Policies

M Beukman - 2023 - search.proquest.com
Reinforcement learning (RL) is a widely-used method for training agents to interact with an
external environment, and is commonly used in fields such as robotics. While RL has …

[CITATION][C] A Survey of Meta-Reinforcement Learning Research

陈奕宇, 霍静, 丁天雨, 高阳 - Journal of Software, 2023