Diffusion model is an effective planner and data synthesizer for multi-task reinforcement learning

H He, C Bai, K Xu, Z Yang, W Zhang… - Advances in neural …, 2023 - proceedings.neurips.cc
Diffusion models have demonstrated highly-expressive generative capabilities in vision and
NLP. Recent studies in reinforcement learning (RL) have shown that diffusion models are …

A Survey of Machine Learning for Urban Decision Making: Applications in Planning, Transportation, and Healthcare

Y Zheng, Q Hao, J Wang, C Gao, J Chen, D **… - ACM Computing …, 2024 - dl.acm.org
Develo** smart cities is vital for ensuring sustainable development and improving human
well-being. One critical aspect of building smart cities is designing intelligent methods to …

Reinforcing LLM Agents via Policy Optimization with Action Decomposition

M Wen, Z Wan, J Wang, W Zhang… - The Thirty-eighth Annual …, 2024 - openreview.net
Language models as intelligent agents push the boundaries of sequential decision-making
agents but struggle with limited knowledge of environmental dynamics and exponentially …

[PDF][PDF] Large Decision Models.

W Zhang - IJCAI, 2023 - ijcai.org
Over recent decades, sequential decision-making tasks are mostly tackled with expert
systems and reinforcement learning. However, these methods are still incapable of being …

Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning

H Tan, S Liu, K Ma, C Ying, X Zhang, H Su… - arxiv preprint arxiv …, 2024 - arxiv.org
Reinforcement learning is able to obtain generalized low-level robot policies on diverse
robotics datasets in embodied learning scenarios, and Transformer has been widely used to …

Trajectory World Models for Heterogeneous Environments

S Yin, J Wu, S Huang, X Su, X He, J Hao… - arxiv preprint arxiv …, 2025 - arxiv.org
Heterogeneity in sensors and actuators across environments poses a significant challenge
to building large-scale pre-trained world models on top of this low-dimensional sensor …

GEAR: a GPU-centric experience replay system for large reinforcement learning models

H Wang, MK Sit, C He, Y Wen… - International …, 2023 - proceedings.mlr.press
This paper introduces a distributed, GPU-centric experience replay system, GEAR, designed
to perform scalable reinforcement learning (RL) with large sequence models (such as …

Building Decision Making Models Through Language Model Regime

Y Zhang, H Liu, F Jiang, W Luo, K Zhang - arxiv preprint arxiv:2408.06087, 2024 - arxiv.org
We propose a novel approach for decision making problems leveraging the generalization
capabilities of large language models (LLMs). Traditional methods such as expert systems …

ROMA: Reverse Model-Based Data Augmentation for Offline Reinforcement Learning

X Wei, W Huang, Z Zhai - International Conference on Big Data and …, 2023 - Springer
One of the main challenges of offline Reinforcement Learning is that the difference between
learning policy and behavior policy leads to the possibility that the agent may need to …