- Academic Search

H He, C Bai, K Xu, Z Yang, W Zhang… - Advances in neural …, 2023 - proceedings.neurips.cc

Diffusion models have demonstrated highly-expressive generative capabilities in vision and
NLP. Recent studies in reinforcement learning (RL) have shown that diffusion models are …

保存引用被引用次数：72 相关文章所有 5 个版本 HTML 版

[Free GPT-4]

[PDF] acm.org

A Survey of Machine Learning for Urban Decision Making: Applications in Planning, Transportation, and Healthcare

Y Zheng, Q Hao, J Wang, C Gao, J Chen, D **… - ACM Computing …, 2024 - dl.acm.org

Develo** smart cities is vital for ensuring sustainable development and improving human
well-being. One critical aspect of building smart cities is designing intelligent methods to …

保存引用相关文章所有 2 个版本

[Free GPT-4]

[PDF] openreview.net

Reinforcing LLM Agents via Policy Optimization with Action Decomposition

M Wen, Z Wan, J Wang, W Zhang… - The Thirty-eighth Annual …, 2024 - openreview.net

Language models as intelligent agents push the boundaries of sequential decision-making
agents but struggle with limited knowledge of environmental dynamics and exponentially …

保存引用被引用次数：2 相关文章 HTML 版

[Free GPT-4]

[PDF] ijcai.org

[PDF][PDF] Large Decision Models.

W Zhang - IJCAI, 2023 - ijcai.org

Over recent decades, sequential decision-making tasks are mostly tackled with expert
systems and reinforcement learning. However, these methods are still incapable of being …

保存引用被引用次数：7 相关文章所有 2 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning

H Tan, S Liu, K Ma, C Ying, X Zhang, H Su… - arxiv preprint arxiv …, 2024 - arxiv.org

Reinforcement learning is able to obtain generalized low-level robot policies on diverse
robotics datasets in embodied learning scenarios, and Transformer has been widely used to …

保存引用被引用次数：1 相关文章所有 3 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

Trajectory World Models for Heterogeneous Environments

S Yin, J Wu, S Huang, X Su, X He, J Hao… - arxiv preprint arxiv …, 2025 - arxiv.org

Heterogeneity in sensors and actuators across environments poses a significant challenge
to building large-scale pre-trained world models on top of this low-dimensional sensor …

保存引用相关文章 HTML 版

[Free GPT-4]

[PDF] mlr.press

GEAR: a GPU-centric experience replay system for large reinforcement learning models

H Wang, MK Sit, C He, Y Wen… - International …, 2023 - proceedings.mlr.press

This paper introduces a distributed, GPU-centric experience replay system, GEAR, designed
to perform scalable reinforcement learning (RL) with large sequence models (such as …

保存引用被引用次数：1 相关文章所有 9 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

Building Decision Making Models Through Language Model Regime

Y Zhang, H Liu, F Jiang, W Luo, K Zhang - arxiv preprint arxiv:2408.06087, 2024 - arxiv.org

We propose a novel approach for decision making problems leveraging the generalization
capabilities of large language models (LLMs). Traditional methods such as expert systems …

保存引用相关文章 HTML 版

ROMA: Reverse Model-Based Data Augmentation for Offline Reinforcement Learning

X Wei, W Huang, Z Zhai - International Conference on Big Data and …, 2023 - Springer

One of the main challenges of offline Reinforcement Learning is that the difference between
learning policy and behavior policy leads to the possibility that the agent may need to …

保存引用相关文章

创建快讯

引用

高级搜索

已保存到“我的图书馆”

On realization of intelligent decision-making in the real world: A foundation decision model...

Diffusion model is an effective planner and data synthesizer for multi-task reinforcement learning

A Survey of Machine Learning for Urban Decision Making: Applications in Planning, Transportation, and Healthcare

Reinforcing LLM Agents via Policy Optimization with Action Decomposition

[PDF][PDF] Large Decision Models.

Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning

Trajectory World Models for Heterogeneous Environments

GEAR: a GPU-centric experience replay system for large reinforcement learning models

Building Decision Making Models Through Language Model Regime

ROMA: Reverse Model-Based Data Augmentation for Offline Reinforcement Learning