Ikuti
Zhiwei Xu
Judul
Dikutip oleh
Dikutip oleh
Tahun
Tptu: Task planning and tool usage of large language model-based ai agents
J Ruan, Y Chen, B Zhang, Z Xu, T Bao, H Mao, Z Li, X Zeng, R Zhao
NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023
132*2023
Controlling large language model-based agents for large-scale decision-making: An actor-critic approach
B Zhang, H Mao, J Ruan, Y Wen, Y Li, S Zhang, Z Xu, D Li, Z Li, R Zhao, ...
arXiv preprint arXiv:2311.13884, 2023
272023
Haven: Hierarchical cooperative multi-agent reinforcement learning with dual coordination mechanism
Z Xu, Y Bai, B Zhang, D Li, G Fan
Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11735 …, 2023
262023
Mmd-mix: Value function factorisation with maximum mean discrepancy for cooperative multi-agent reinforcement learning
Z Xu, D Li, Y Bai, G Fan
2021 International Joint Conference on Neural Networks (IJCNN), 1-7, 2021
122021
From explicit communication to tacit cooperation: A novel paradigm for cooperative marl
D Li, Z Xu, B Zhang, G Fan
arXiv preprint arXiv:2304.14656, 2023
112023
Efficient policy generation in multi-agent systems via hypergraph neural network
B Zhang, Y Bai, Z Xu, D Li, G Fan
International Conference on Neural Information Processing, 219-230, 2022
11*2022
Consensus learning for cooperative multi-agent reinforcement learning
Z Xu, B Zhang, D Li, Z Zhang, G Zhou, H Chen, G Fan
Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11726 …, 2023
102023
Inducing stackelberg equilibrium through spatio-temporal sequential decision-making in multi-agent reinforcement learning
B Zhang, L Li, Z Xu, D Li, G Fan
arXiv preprint arXiv:2304.10351, 2023
102023
Side: State inference for partially observable cooperative multi-agent reinforcement learning
Z Xu, Y Bai, D Li, B Zhang, G Fan
arXiv preprint arXiv:2105.06228, 2021
102021
Learning to coordinate via multiple graph neural networks
Z Xu, B Zhang, Y Bai, D Li, G Fan
Neural Information Processing: 28th International Conference, ICONIP 2021 …, 2021
92021
Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach
B Zhang, H Mao, L Li, Z Xu, D Li, R Zhao, G Fan
Forty-first International Conference on Machine Learning, 2024
7*2024
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning
Z Xu, D Li, B Zhang, Y Zhan, Y Bai, G Fan
Advances in Neural Information Processing Systems 35, 11327-11340, 2022
72022
Pdit: Interleaving perception and decision-making transformers for deep reinforcement learning
H Mao, R Zhao, Z Li, Z Xu, H Chen, Y Chen, B Zhang, Z Xiao, J Zhang, ...
arXiv preprint arXiv:2312.15863, 2023
62023
Dual self-awareness value decomposition framework without individual global max for cooperative MARL
Z Xu, B Zhang, G Zhou, Z Zhang, G Fan
Advances in Neural Information Processing Systems 36, 73898-73918, 2023
6*2023
Style miner: Find significant and stable explanatory factors in time series with constrained reinforcement learning
D Li, F Pan, J He, Z Xu, D Tu, G Fan
arXiv preprint arXiv:2303.11716, 2023
32023
Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning
Z Xu, H Mao, N Zhang, X Xin, P Ren, D Li, B Zhang, G Fan, Z Chen, ...
arXiv preprint arXiv:2408.09501, 2024
22024
Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning
D Li, H Dong, L Wang, B Qiao, S Qin, Q Lin, D Zhang, Q Zhang, Z Xu, ...
arXiv preprint arXiv:2404.17780, 2024
22024
SORA: Improving Multi-agent Cooperation with a Soft Role Assignment Mechanism
G Zhou, Z Xu, Z Zhang, G Fan
International Conference on Neural Information Processing, 319-331, 2023
22023
Sea: A spatially explicit architecture for multi-agent reinforcement learning
D Li, Z Xu, B Zhang, G Fan
2023 International Joint Conference on Neural Networks (IJCNN), 1-8, 2023
22023
Multi-agent hyper-attention policy optimization
B Zhang, Z Xu, Y Chen, D Li, Y Bai, G Fan, L Li
International Conference on Neural Information Processing, 76-87, 2022
22022
Sistem tidak dapat melakukan operasi ini. Coba lagi nanti.
Artikel 1–20