Hangyu Mao（毛航宇）

Cited by

	All	Since 2020
Citations	1016	997
h-index	18	18
i10-index	23	23

480

240

120

360

201820192020202120222023202420255 8 48 72 145 230 470 32

Public access

View all

13 articles

2 articles

available

not available

Based on funding mandates

Hangyu Mao（毛航宇）

Peking University

Verified email at pku.edu.cn - Homepage

AI Agent Multi-Agent Reinforcement Learning Reinforcement Learning Large Language Model


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Tptu: Task planning and tool usage of large language model-based ai agents J Ruan, Y Chen, B Zhang, Z Xu, T Bao, G Du, S Shi, H Mao, X Zeng, ... NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023	131*	2023
Modelling the dynamic joint policy of teammates with attention multi-agent DDPG H Mao, Z Zhang, Z Xiao, Z Gong Proceedings of the 18th International Conference on Autonomous Agents and …, 2019	112	2019
Learning agent communication under limited bandwidth by message pruning H Mao, Z Zhang, Z Xiao, Z Gong, Y Ni Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5142-5149, 2020	91	2020
Neighborhood cognition consistent multi-agent reinforcement learning H Mao, W Liu, J Hao, J Luo, D Li, Z Zhang, J Wang, Z Xiao Proceedings of the AAAI conference on artificial intelligence 34 (05), 7219-7226, 2020	83	2020
Accnet: Actor-coordinator-critic net for" learning-to-communicate" with deep multi-agent reinforcement learning H Mao, Z Gong, Y Ni, Z Xiao arXiv preprint arXiv:1706.03235, 2017	50	2017
Learning multi-agent communication with double attentional deep reinforcement learning H Mao, Z Zhang, Z Xiao, Z Gong, Y Ni Autonomous Agents and Multi-Agent Systems 34, 1-34, 2020	46	2020
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Industry Systems Y Kong, J Ruan, Y Chen, B Zhang, T Bao, S Shiwei, D Qing, X Hu, H Mao, ... Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024	33*	2024
An efficient transfer learning framework for multiagent reinforcement learning T Yang, W Wang, H Tang, J Hao, Z Meng, H Mao, D Li, W Liu, Y Chen, ... Advances in Neural Information Processing Systems 34, 17037-17048, 2021	32	2021
Learning multi-agent communication under limited-bandwidth restriction for internet packet routing H Mao, Z Gong, Z Zhang, Z Xiao, Y Ni arXiv preprint arXiv:1903.05561, 2019	32	2019
Benchmarking the text-to-sql capability of large language models: A comprehensive evaluation B Zhang, Y Ye, G Du, X Hu, Z Li, S Yang, CH Liu, R Zhao, Z Li, H Mao arXiv preprint arXiv:2403.02951, 2024	30	2024
Reward design in cooperative multi-agent reinforcement learning for packet routing H Mao, Z Gong, Z Xiao arXiv preprint arXiv:2003.03433, 2020	28	2020
Controlling large language model-based agents for large-scale decision-making: An actor-critic approach B Zhang, H Mao, J Ruan, Y Wen, Y Li, S Zhang, Z Xu, D Li, Z Li, R Zhao, ... arXiv preprint arXiv:2311.13884, 2023	27	2023
Boosting multiagent reinforcement learning via permutation invariant and permutation equivariant networks HAO Jianye, X Hao, H Mao, W Wang, Y Yang, D Li, Y Zheng, Z Wang The Eleventh International Conference on Learning Representations, 2022	26	2022
Seihai: A sample-efficient hierarchical ai for the minerl competition H Mao, C Wang, X Hao, Y Mao, Y Lu, C Wu, J Hao, D Li, P Tang Distributed Artificial Intelligence: Third International Conference, DAI …, 2022	25	2022
What about inputting policy in value function: Policy representation and policy-extended value function approximator H Tang, Z Meng, J Hao, C Chen, D Graves, D Li, C Yu, H Mao, W Liu, ... Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 8441-8449, 2022	24	2022
Structural relational inference actor-critic for multi-agent reinforcement learning X Zhang, Y Liu, X Xu, Q Huang, H Mao, A Carie Neurocomputing 459, 383-394, 2021	24	2021
Cooperative multi-agent transfer learning with level-adaptive credit assignment T Zhou, F Zhang, K Shao, K Li, W Huang, J Luo, W Wang, Y Yang, H Mao, ... arXiv preprint arXiv:2106.00517, 2021	21	2021
PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency Z Li, X Wang, J Zhao, S Yang, G Du, X Hu, B Zhang, Y Ye, Z Li, R Zhao, ... arXiv preprint arXiv:2403.09732, 2024	18	2024
Towards robust and domain agnostic reinforcement learning competitions: MineRL 2020 WH Guss, S Milani, N Topin, B Houghton, S Mohanty, A Melnik, A Harter, ... NeurIPS 2020 Competition and Demonstration Track, 233-252, 2021	14	2021
Transformer in transformer as backbone for deep reinforcement learning H Mao, R Zhao, H Chen, J Hao, Y Chen, D Li, J Zhang, Z Xiao arXiv preprint arXiv:2212.14538, 2022	12	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by