Controlling large language model-based agents for large-scale decision-making: An actor-critic approach B Zhang, H Mao, J Ruan, Y Wen, Y Li, S Zhang, Z Xu, D Li, Z Li, R Zhao, ... arXiv preprint arXiv:2311.13884, 2023 | 27 | 2023 |
Haven: Hierarchical cooperative multi-agent reinforcement learning with dual coordination mechanism Z Xu, Y Bai, B Zhang, D Li, G Fan Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11735 …, 2023 | 24 | 2023 |
Mmd-mix: Value function factorisation with maximum mean discrepancy for cooperative multi-agent reinforcement learning Z Xu, D Li, Y Bai, G Fan 2021 International Joint Conference on Neural Networks (IJCNN), 1-7, 2021 | 12 | 2021 |
From explicit communication to tacit cooperation: A novel paradigm for cooperative marl D Li, Z Xu, B Zhang, G Fan arXiv preprint arXiv:2304.14656, 2023 | 11 | 2023 |
Efficient policy generation in multi-agent systems via hypergraph neural network B Zhang, Y Bai, Z Xu, D Li, G Fan International Conference on Neural Information Processing, 219-230, 2022 | 11* | 2022 |
Consensus learning for cooperative multi-agent reinforcement learning Z Xu, B Zhang, D Li, Z Zhang, G Zhou, H Chen, G Fan Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11726 …, 2023 | 10 | 2023 |
Inducing stackelberg equilibrium through spatio-temporal sequential decision-making in multi-agent reinforcement learning B Zhang, L Li, Z Xu, D Li, G Fan arXiv preprint arXiv:2304.10351, 2023 | 10 | 2023 |
Side: State inference for partially observable cooperative multi-agent reinforcement learning Z Xu, Y Bai, D Li, B Zhang, G Fan arXiv preprint arXiv:2105.06228, 2021 | 10 | 2021 |
Learning to coordinate via multiple graph neural networks Z Xu, B Zhang, Y Bai, D Li, G Fan Neural Information Processing: 28th International Conference, ICONIP 2021 …, 2021 | 9 | 2021 |
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning Z Xu, D Li, B Zhang, Y Zhan, Y Baiia, G Fan Advances in Neural Information Processing Systems 35, 11327-11340, 2022 | 7 | 2022 |
Stackelberg decision transformer for asynchronous action coordination in multi-agent systems B Zhang, H Mao, L Li, Z Xu, D Li, R Zhao, G Fan arXiv preprint arXiv:2305.07856, 2023 | 6 | 2023 |
Dual self-awareness value decomposition framework without individual global max for cooperative multi-agent reinforcement learning Z Xu, B Zhang, D Li, G Zhou, Z Zhang, G Fan arXiv preprint arXiv:2302.02180, 2023 | 4 | 2023 |
Style miner: Find significant and stable explanatory factors in time series with constrained reinforcement learning D Li, F Pan, J He, Z Xu, D Tu, G Fan arXiv preprint arXiv:2303.11716, 2023 | 3 | 2023 |
Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning D Li, H Dong, L Wang, B Qiao, S Qin, Q Lin, D Zhang, Q Zhang, Z Xu, ... arXiv preprint arXiv:2404.17780, 2024 | 2 | 2024 |
Dual self-awareness value decomposition framework without individual global max for cooperative MARL Z Xu, B Zhang, G Zhou, Z Zhang, G Fan Advances in Neural Information Processing Systems 36, 73898-73918, 2023 | 2 | 2023 |
Sea: A spatially explicit architecture for multi-agent reinforcement learning D Li, Z Xu, B Zhang, G Fan 2023 International Joint Conference on Neural Networks (IJCNN), 1-8, 2023 | 2 | 2023 |
Multi-agent hyper-attention policy optimization B Zhang, Z Xu, Y Chen, D Li, Y Bai, G Fan, L Li International Conference on Neural Information Processing, 76-87, 2022 | 2 | 2022 |
Adaptive parameter sharing for multi-agent reinforcement learning D Li, N Lou, B Zhang, Z Xu, G Fan ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach B Zhang, H Mao, L Li, Z Xu, D Li, R Zhao, G Fan Forty-first International Conference on Machine Learning, 0 | 1 | |
Constructing Informative Subtask Representations for Multi-Agent Coordination G Zhou, Z Xu, B Zhang, D Li, Z Zhang, G Fan | | |