Zhiwei Xu

Dikutip oleh

	Semua	Sejak 2020
Kutipan	302	301
indeks-h	9	9
indeks-i10	9	9

220

110

165

202220232024202511 70 201 19

Akses publik

Lihat semua

2 artikel

0 artikel

tersedia

tidak tersedia

Berdasarkan pada mandat pendanaan

Pengarang bersama

Bin ZhangInstitute of Automation,Chinese Academy of SciencesEmail yang diverifikasi di ia.ac.cn
Dapeng LiInstitute of Automation, Chinese Academy of SciencesEmail yang diverifikasi di ia.ac.cn
Hangyu Mao（毛航宇）Peking UniversityEmail yang diverifikasi di pku.edu.cn
Jingqing RuanMeituanEmail yang diverifikasi di meituan.com
Guangchong ZhouInstitute of Automation, Chinese Academy of SciencesEmail yang diverifikasi di sjtu.edu.cn
Zeren ZhangInstitute of Automation, Chinese Academy of ScienceEmail yang diverifikasi di ia.ac.cn
Hao ChenUniversity College London | University of Chinese Academy of SciencesEmail yang diverifikasi di ucl.ac.uk
Yiqun ChenRenmin University of ChinaEmail yang diverifikasi di ruc.edu.cn

Ikuti

Zhiwei Xu

Nama lainnya徐志伟

Shandong University

Email yang diverifikasi di sdu.edu.cn - Beranda

Reinforcement Learning Multi-Agent System LLM-based Agent


Judul Urutkan menurut kutipan Urutkan menurut tahun Urutkan menurut judul	Dikutip oleh Dikutip oleh	Tahun
Tptu: Task planning and tool usage of large language model-based ai agents J Ruan, Y Chen, B Zhang, Z Xu, T Bao, H Mao, Z Li, X Zeng, R Zhao NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023	132*	2023
Controlling large language model-based agents for large-scale decision-making: An actor-critic approach B Zhang, H Mao, J Ruan, Y Wen, Y Li, S Zhang, Z Xu, D Li, Z Li, R Zhao, ... arXiv preprint arXiv:2311.13884, 2023	27	2023
Haven: Hierarchical cooperative multi-agent reinforcement learning with dual coordination mechanism Z Xu, Y Bai, B Zhang, D Li, G Fan Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11735 …, 2023	26	2023
Mmd-mix: Value function factorisation with maximum mean discrepancy for cooperative multi-agent reinforcement learning Z Xu, D Li, Y Bai, G Fan 2021 International Joint Conference on Neural Networks (IJCNN), 1-7, 2021	12	2021
From explicit communication to tacit cooperation: A novel paradigm for cooperative marl D Li, Z Xu, B Zhang, G Fan arXiv preprint arXiv:2304.14656, 2023	11	2023
Efficient policy generation in multi-agent systems via hypergraph neural network B Zhang, Y Bai, Z Xu, D Li, G Fan International Conference on Neural Information Processing, 219-230, 2022	11*	2022
Consensus learning for cooperative multi-agent reinforcement learning Z Xu, B Zhang, D Li, Z Zhang, G Zhou, H Chen, G Fan Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11726 …, 2023	10	2023
Inducing stackelberg equilibrium through spatio-temporal sequential decision-making in multi-agent reinforcement learning B Zhang, L Li, Z Xu, D Li, G Fan arXiv preprint arXiv:2304.10351, 2023	10	2023
Side: State inference for partially observable cooperative multi-agent reinforcement learning Z Xu, Y Bai, D Li, B Zhang, G Fan arXiv preprint arXiv:2105.06228, 2021	10	2021
Learning to coordinate via multiple graph neural networks Z Xu, B Zhang, Y Bai, D Li, G Fan Neural Information Processing: 28th International Conference, ICONIP 2021 …, 2021	9	2021
Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach B Zhang, H Mao, L Li, Z Xu, D Li, R Zhao, G Fan Forty-first International Conference on Machine Learning, 2024	7*	2024
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning Z Xu, D Li, B Zhang, Y Zhan, Y Bai, G Fan Advances in Neural Information Processing Systems 35, 11327-11340, 2022	7	2022
Pdit: Interleaving perception and decision-making transformers for deep reinforcement learning H Mao, R Zhao, Z Li, Z Xu, H Chen, Y Chen, B Zhang, Z Xiao, J Zhang, ... arXiv preprint arXiv:2312.15863, 2023	6	2023
Dual self-awareness value decomposition framework without individual global max for cooperative MARL Z Xu, B Zhang, G Zhou, Z Zhang, G Fan Advances in Neural Information Processing Systems 36, 73898-73918, 2023	6*	2023
Style miner: Find significant and stable explanatory factors in time series with constrained reinforcement learning D Li, F Pan, J He, Z Xu, D Tu, G Fan arXiv preprint arXiv:2303.11716, 2023	3	2023
Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning Z Xu, H Mao, N Zhang, X Xin, P Ren, D Li, B Zhang, G Fan, Z Chen, ... arXiv preprint arXiv:2408.09501, 2024	2	2024
Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning D Li, H Dong, L Wang, B Qiao, S Qin, Q Lin, D Zhang, Q Zhang, Z Xu, ... arXiv preprint arXiv:2404.17780, 2024	2	2024
SORA: Improving Multi-agent Cooperation with a Soft Role Assignment Mechanism G Zhou, Z Xu, Z Zhang, G Fan International Conference on Neural Information Processing, 319-331, 2023	2	2023
Sea: A spatially explicit architecture for multi-agent reinforcement learning D Li, Z Xu, B Zhang, G Fan 2023 International Joint Conference on Neural Networks (IJCNN), 1-8, 2023	2	2023
Multi-agent hyper-attention policy optimization B Zhang, Z Xu, Y Chen, D Li, Y Bai, G Fan, L Li International Conference on Neural Information Processing, 76-87, 2022	2	2022

Sistem tidak dapat melakukan operasi ini. Coba lagi nanti.

Artikel 1–20

Kutipan per tahun

Kutipan duplikat

Kutipan yang digabung

Tambahkan pengarang bersamaPengarang bersama

Ikuti

Dikutip oleh

Pengarang bersama