Tptu: Task planning and tool usage of large language model-based ai agents J Ruan, Y Chen, B Zhang, Z Xu, T Bao, G Du, S Shi, H Mao, X Zeng, ... NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023 | 131* | 2023 |
Modelling the dynamic joint policy of teammates with attention multi-agent DDPG H Mao, Z Zhang, Z Xiao, Z Gong Proceedings of the 18th International Conference on Autonomous Agents and …, 2019 | 112 | 2019 |
Learning agent communication under limited bandwidth by message pruning H Mao, Z Zhang, Z Xiao, Z Gong, Y Ni Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5142-5149, 2020 | 91 | 2020 |
Neighborhood cognition consistent multi-agent reinforcement learning H Mao, W Liu, J Hao, J Luo, D Li, Z Zhang, J Wang, Z Xiao Proceedings of the AAAI conference on artificial intelligence 34 (05), 7219-7226, 2020 | 83 | 2020 |
Accnet: Actor-coordinator-critic net for" learning-to-communicate" with deep multi-agent reinforcement learning H Mao, Z Gong, Y Ni, Z Xiao arXiv preprint arXiv:1706.03235, 2017 | 50 | 2017 |
Learning multi-agent communication with double attentional deep reinforcement learning H Mao, Z Zhang, Z Xiao, Z Gong, Y Ni Autonomous Agents and Multi-Agent Systems 34, 1-34, 2020 | 46 | 2020 |
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Industry Systems Y Kong, J Ruan, Y Chen, B Zhang, T Bao, S Shiwei, D Qing, X Hu, H Mao, ... Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024 | 33* | 2024 |
An efficient transfer learning framework for multiagent reinforcement learning T Yang, W Wang, H Tang, J Hao, Z Meng, H Mao, D Li, W Liu, Y Chen, ... Advances in Neural Information Processing Systems 34, 17037-17048, 2021 | 32 | 2021 |
Learning multi-agent communication under limited-bandwidth restriction for internet packet routing H Mao, Z Gong, Z Zhang, Z Xiao, Y Ni arXiv preprint arXiv:1903.05561, 2019 | 32 | 2019 |
Benchmarking the text-to-sql capability of large language models: A comprehensive evaluation B Zhang, Y Ye, G Du, X Hu, Z Li, S Yang, CH Liu, R Zhao, Z Li, H Mao arXiv preprint arXiv:2403.02951, 2024 | 30 | 2024 |
Reward design in cooperative multi-agent reinforcement learning for packet routing H Mao, Z Gong, Z Xiao arXiv preprint arXiv:2003.03433, 2020 | 28 | 2020 |
Controlling large language model-based agents for large-scale decision-making: An actor-critic approach B Zhang, H Mao, J Ruan, Y Wen, Y Li, S Zhang, Z Xu, D Li, Z Li, R Zhao, ... arXiv preprint arXiv:2311.13884, 2023 | 27 | 2023 |
Boosting multiagent reinforcement learning via permutation invariant and permutation equivariant networks HAO Jianye, X Hao, H Mao, W Wang, Y Yang, D Li, Y Zheng, Z Wang The Eleventh International Conference on Learning Representations, 2022 | 26 | 2022 |
Seihai: A sample-efficient hierarchical ai for the minerl competition H Mao, C Wang, X Hao, Y Mao, Y Lu, C Wu, J Hao, D Li, P Tang Distributed Artificial Intelligence: Third International Conference, DAI …, 2022 | 25 | 2022 |
What about inputting policy in value function: Policy representation and policy-extended value function approximator H Tang, Z Meng, J Hao, C Chen, D Graves, D Li, C Yu, H Mao, W Liu, ... Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 8441-8449, 2022 | 24 | 2022 |
Structural relational inference actor-critic for multi-agent reinforcement learning X Zhang, Y Liu, X Xu, Q Huang, H Mao, A Carie Neurocomputing 459, 383-394, 2021 | 24 | 2021 |
Cooperative multi-agent transfer learning with level-adaptive credit assignment T Zhou, F Zhang, K Shao, K Li, W Huang, J Luo, W Wang, Y Yang, H Mao, ... arXiv preprint arXiv:2106.00517, 2021 | 21 | 2021 |
PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency Z Li, X Wang, J Zhao, S Yang, G Du, X Hu, B Zhang, Y Ye, Z Li, R Zhao, ... arXiv preprint arXiv:2403.09732, 2024 | 18 | 2024 |
Towards robust and domain agnostic reinforcement learning competitions: MineRL 2020 WH Guss, S Milani, N Topin, B Houghton, S Mohanty, A Melnik, A Harter, ... NeurIPS 2020 Competition and Demonstration Track, 233-252, 2021 | 14 | 2021 |
Transformer in transformer as backbone for deep reinforcement learning H Mao, R Zhao, H Chen, J Hao, Y Chen, D Li, J Zhang, Z Xiao arXiv preprint arXiv:2212.14538, 2022 | 12 | 2022 |