A survey of progress on cooperative multi-agent reinforcement learning in open environment
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …
has made progress in various fields. Specifically, cooperative MARL focuses on training a …
Graph decision transformer
Offline reinforcement learning (RL) is a challenging task, whose objective is to learn policies
from static trajectory data without interacting with the environment. Recently, offline RL has …
from static trajectory data without interacting with the environment. Recently, offline RL has …
Decomposed Prompt Decision Transformer for Efficient Unseen Task Generalization
Multi-task offline reinforcement learning aims to develop a unified policy for diverse tasks
without requiring real-time interaction with the environment. Recent work explores sequence …
without requiring real-time interaction with the environment. Recent work explores sequence …
Prompt-tuning decision transformer with preference ranking
Prompt-tuning has emerged as a promising method for adapting pre-trained models to
downstream tasks or aligning with human preferences. Prompt learning is widely used in …
downstream tasks or aligning with human preferences. Prompt learning is widely used in …
Saformer: A conditional sequence modeling approach to offline safe reinforcement learning
Offline safe RL is of great practical relevance for deploying agents in real-world applications.
However, acquiring constraint-satisfying policies from the fixed dataset is non-trivial for …
However, acquiring constraint-satisfying policies from the fixed dataset is non-trivial for …
Pdit: Interleaving perception and decision-making transformers for deep reinforcement learning
Designing better deep networks and better reinforcement learning (RL) algorithms are both
important for deep RL. This work studies the former. Specifically, the Perception and …
important for deep RL. This work studies the former. Specifically, the Perception and …
Instructed diffuser with temporal condition guidance for offline reinforcement learning
Recent works have shown the potential of diffusion models in computer vision and natural
language processing. Apart from the classical supervised learning fields, diffusion models …
language processing. Apart from the classical supervised learning fields, diffusion models …
Transformer in transformer as backbone for deep reinforcement learning
Designing better deep networks and better reinforcement learning (RL) algorithms are both
important for deep RL. This work focuses on the former. Previous methods build the network …
important for deep RL. This work focuses on the former. Previous methods build the network …
Q-value regularized transformer for offline reinforcement learning
Recent advancements in offline reinforcement learning (RL) have underscored the
capabilities of Conditional Sequence Modeling (CSM), a paradigm that learns the action …
capabilities of Conditional Sequence Modeling (CSM), a paradigm that learns the action …
HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
The purpose of offline multi-task reinforcement learning (MTRL) is to develop a unified policy
applicable to diverse tasks without the need for online environmental interaction. Recent …
applicable to diverse tasks without the need for online environmental interaction. Recent …