Diffusion model is an effective planner and data synthesizer for multi-task reinforcement learning
Diffusion models have demonstrated highly-expressive generative capabilities in vision and
NLP. Recent studies in reinforcement learning (RL) have shown that diffusion models are …
NLP. Recent studies in reinforcement learning (RL) have shown that diffusion models are …
A Survey of Machine Learning for Urban Decision Making: Applications in Planning, Transportation, and Healthcare
Develo** smart cities is vital for ensuring sustainable development and improving human
well-being. One critical aspect of building smart cities is designing intelligent methods to …
well-being. One critical aspect of building smart cities is designing intelligent methods to …
Reinforcing LLM Agents via Policy Optimization with Action Decomposition
Language models as intelligent agents push the boundaries of sequential decision-making
agents but struggle with limited knowledge of environmental dynamics and exponentially …
agents but struggle with limited knowledge of environmental dynamics and exponentially …
Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning
Reinforcement learning is able to obtain generalized low-level robot policies on diverse
robotics datasets in embodied learning scenarios, and Transformer has been widely used to …
robotics datasets in embodied learning scenarios, and Transformer has been widely used to …
Trajectory World Models for Heterogeneous Environments
Heterogeneity in sensors and actuators across environments poses a significant challenge
to building large-scale pre-trained world models on top of this low-dimensional sensor …
to building large-scale pre-trained world models on top of this low-dimensional sensor …
GEAR: a GPU-centric experience replay system for large reinforcement learning models
This paper introduces a distributed, GPU-centric experience replay system, GEAR, designed
to perform scalable reinforcement learning (RL) with large sequence models (such as …
to perform scalable reinforcement learning (RL) with large sequence models (such as …
Building Decision Making Models Through Language Model Regime
We propose a novel approach for decision making problems leveraging the generalization
capabilities of large language models (LLMs). Traditional methods such as expert systems …
capabilities of large language models (LLMs). Traditional methods such as expert systems …
ROMA: Reverse Model-Based Data Augmentation for Offline Reinforcement Learning
X Wei, W Huang, Z Zhai - International Conference on Big Data and …, 2023 - Springer
One of the main challenges of offline Reinforcement Learning is that the difference between
learning policy and behavior policy leads to the possibility that the agent may need to …
learning policy and behavior policy leads to the possibility that the agent may need to …