Tptu: Task planning and tool usage of large language model-based ai agents J Ruan, Y Chen, B Zhang, Z Xu, T Bao, H Mao, Z Li, X Zeng, R Zhao NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023 | 115 | 2023 |
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Industry Systems Y Kong, J Ruan, Y Chen, B Zhang, T Bao, S Shiwei, D Qing, X Hu, H Mao, ... Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024 | 33* | 2024 |
Benchmarking the text-to-sql capability of large language models: A comprehensive evaluation B Zhang, Y Ye, G Du, X Hu, Z Li, S Yang, CH Liu, R Zhao, Z Li, H Mao arXiv preprint arXiv:2403.02951, 2024 | 31 | 2024 |
Controlling large language model-based agents for large-scale decision-making: An actor-critic approach B Zhang, H Mao, J Ruan, Y Wen, Y Li, S Zhang, Z Xu, D Li, Z Li, R Zhao, ... LLM Agents Workshop@ICLR2024, 2023 | 27 | 2023 |
HAVEN: hierarchical cooperative multi-agent reinforcement learning with dual coordination mechanism Z Xu, Y Bai, B Zhang, D Li, G Fan Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11735 …, 2023 | 26 | 2023 |
PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency Z Li, X Wang, J Zhao, S Yang, G Du, X Hu, B Zhang, Y Ye, Z Li, R Zhao, ... arXiv preprint arXiv:2403.09732, 2024 | 18 | 2024 |
TPTU: large language model-based AI agents for task planning and tool usage J Ruan, Y Chen, B Zhang, Z Xu, T Bao, G Du, S Shi, H Mao, Z Li, X Zeng, ... arXiv preprint arXiv:2308.03427, 2023 | 18 | 2023 |
Ptde: Personalized training with distillated execution for multi-agent reinforcement learning Y Chen, H Mao, T Zhang, S Wu, B Zhang, J Hao, D Li, B Wang, H Chang arXiv preprint arXiv:2210.08872, 2022 | 12 | 2022 |
Cooperative multi-agent reinforcement learning with hypergraph convolution Y Bai, C Gong, B Zhang, G Fan, X Hou, Y Lu 2022 International Joint Conference on Neural Networks (IJCNN), 1-8, 2022 | 12* | 2022 |
From explicit communication to tacit cooperation: A novel paradigm for cooperative marl D Li, Z Xu, B Zhang, G Fan arXiv preprint arXiv:2304.14656, 2023 | 11 | 2023 |
Efficient Policy Generation in Multi-agent Systems via Hypergraph Neural Network B Zhang, Y Bai, Z Xu, D Li, G Fan International Conference on Neural Information Processing (ICONIP), 219-230, 2022 | 11* | 2022 |
Consensus learning for cooperative multi-agent reinforcement learning Z Xu, B Zhang, D Li, Z Zhang, G Zhou, H Chen, G Fan Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11726 …, 2023 | 10 | 2023 |
Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning B Zhang, L Li, Z Xu, D Li, G Fan IJCAI 2023, 2023 | 10 | 2023 |
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning Z Xu, Y Bai, D Li, B Zhang, G Fan The 19nd International Conference on Autonomous Agents and Multiagent …, 2021 | 10 | 2021 |
Learning to coordinate via multiple graph neural networks Z Xu, B Zhang, Y Bai, D Li, G Fan Neural Information Processing: 28th International Conference, ICONIP 2021 …, 2021 | 9 | 2021 |
Mingling foresight with imagination: Model-based cooperative multi-agent reinforcement learning Z Xu, B Zhang, Y Zhan, Y Baiia, G Fan Advances in Neural Information Processing Systems 35, 11327-11340, 2022 | 7 | 2022 |
Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach B Zhang, H Mao, L Li, Z Xu, D Li, R Zhao, G Fan Forty-first International Conference on Machine Learning, 0 | 7* | |
Dual self-awareness value decomposition framework without individual global max for cooperative MARL Z Xu, B Zhang, G Zhou, Z Zhang, G Fan Advances in Neural Information Processing Systems 36, 2024 | 6* | 2024 |
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning H Mao, R Zhao, Z Li, Z Xu, H Chen, Y Chen, B Zhang, Z Xiao, J Zhang, ... arXiv preprint arXiv:2312.15863, 2023 | 6 | 2023 |
Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning Z Xu, H Mao, N Zhang, X Xin, P Ren, D Li, B Zhang, G Fan, Z Chen, ... arXiv preprint arXiv:2408.09501, 2024 | 2 | 2024 |