On Transforming Reinforcement Learning With Transformers: The Development Trajectory
Transformers, originally devised for natural language processing (NLP), have also produced
significant successes in computer vision (CV). Due to their strong expression power …
significant successes in computer vision (CV). Due to their strong expression power …
Critic-guided decision transformer for offline reinforcement learning
Recent advancements in offline reinforcement learning (RL) have underscored the
capabilities of Return-Conditioned Supervised Learning (RCSL), a paradigm that learns the …
capabilities of Return-Conditioned Supervised Learning (RCSL), a paradigm that learns the …
Deep generative models for offline policy learning: Tutorial, survey, and perspectives on future directions
Deep generative models (DGMs) have demonstrated great success across various domains,
particularly in generating texts, images, and videos using models trained from offline data …
particularly in generating texts, images, and videos using models trained from offline data …
Crossway diffusion: Improving diffusion-based visuomotor policy via self-supervised learning
Diffusion models have been adopted for behavioral cloning in a sequence modeling
fashion, benefiting from their exceptional capabilities in modeling complex data distributions …
fashion, benefiting from their exceptional capabilities in modeling complex data distributions …
Learning multi-agent communication from graph modeling perspective
In numerous artificial intelligence applications, the collaborative efforts of multiple intelligent
agents are imperative for the successful attainment of target objectives. To enhance …
agents are imperative for the successful attainment of target objectives. To enhance …
Prompt-tuning decision transformer with preference ranking
Prompt-tuning has emerged as a promising method for adapting pre-trained models to
downstream tasks or aligning with human preferences. Prompt learning is widely used in …
downstream tasks or aligning with human preferences. Prompt learning is widely used in …
Pdit: Interleaving perception and decision-making transformers for deep reinforcement learning
Designing better deep networks and better reinforcement learning (RL) algorithms are both
important for deep RL. This work studies the former. Specifically, the Perception and …
important for deep RL. This work studies the former. Specifically, the Perception and …
Q-value regularized transformer for offline reinforcement learning
Recent advancements in offline reinforcement learning (RL) have underscored the
capabilities of Conditional Sequence Modeling (CSM), a paradigm that learns the action …
capabilities of Conditional Sequence Modeling (CSM), a paradigm that learns the action …
HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
The purpose of offline multi-task reinforcement learning (MTRL) is to develop a unified policy
applicable to diverse tasks without the need for online environmental interaction. Recent …
applicable to diverse tasks without the need for online environmental interaction. Recent …
Gradformer: Graph Transformer with Exponential Decay
Graph Transformers (GTs) have demonstrated their advantages across a wide range of
tasks. However, the self-attention mechanism in GTs overlooks the graph's inductive biases …
tasks. However, the self-attention mechanism in GTs overlooks the graph's inductive biases …