Thinking fast and slow with deep learning and tree search T Anthony, Z Tian, D Barber Advances in Neural Information Processing Systems, 5360-5370, 2017 | 445 | 2017 |
SMARTS: An Open-Source Scalable Multi-Agent RL Training School for Autonomous Driving M Zhou, J Luo, J Villella, Y Yang, D Rusu, J Miao, W Zhang, M Alban, ... Conference on Robot Learning, 264-285, 2021 | 225 | 2021 |
Multi-Agent Constrained Policy Optimisation S Gu, JG Kuba, M Wen, R Chen, Z Wang, Z Tian, J Wang, A Knoll, Y Yang arXiv preprint arXiv:2110.02793, 2021 | 57 | 2021 |
A regularized opponent model with maximum entropy objective Z Tian, Y Wen, Z Gong, F Punakkath, S Zou, J Wang arXiv preprint arXiv:1905.08087, 2019 | 48 | 2019 |
Learning to Communicate Implicitly by Actions. Z Tian, S Zou, I Davies, T Warr, L Wu, H Bou-Ammar, J Wang AAAI, 7261-7268, 2020 | 45* | 2020 |
Order Matters: Agent-by-agent Policy Optimization X Wang, Z Tian, Z Wan, Y Wen, J Wang, W Zhang arXiv preprint arXiv:2302.06205, 2023 | 23 | 2023 |
Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer H Lai, W Zhang, X He, C Yu, Z Tian, Y Yu, J Wang 2023 IEEE International Conference on Robotics and Automation (ICRA), 5141-5147, 2023 | 18 | 2023 |
M2N: mesh movement networks for PDE solvers W Song, M Zhang, JG Wallwork, J Gao, Z Tian, F Sun, M Piggott, J Chen, ... Advances in Neural Information Processing Systems 35, 7199-7210, 2022 | 18 | 2022 |
Language and Sketching: An LLM-driven Interactive Multimodal Multitask Robot Navigation Framework W Zu, W Song, R Chen, Z Guo, F Sun, Z Tian, W Pan, J Wang arXiv preprint arXiv:2311.08244, 2023 | 15 | 2023 |
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem C Yu, W Zhang, H Lai, Z Tian, L Kneip, J Wang 2023 IEEE International Conference on Robotics and Automation (ICRA), 7250-7257, 2023 | 15 | 2023 |
A game-theoretic approach to multi-agent trust region optimization Y Wen, H Chen, Y Yang, M Li, Z Tian, X Chen, J Wang International Conference on Distributed Artificial Intelligence, 74-87, 2022 | 12 | 2022 |
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective Y Wen, Z Wan, M Zhou, S Hou, Z Cao, C Le, J Chen, Z Tian, W Zhang, ... arXiv preprint arXiv:2212.12669, 2022 | 9 | 2022 |
Learning to Model Opponent Learning (Student Abstract) I Davies, Z Tian, J Wang Proceedings of the AAAI Conference on Artificial Intelligence 34 (10), 13771 …, 2020 | 9 | 2020 |
An LLM-driven Framework for Multiple-Vehicle Dispatching and Navigation in Smart City Landscapes R Chen, W Song, W Zu, ZX Dong, Z Guo, F Sun, Z Tian, J Wang 2024 IEEE International Conference on Robotics and Automation (ICRA), 2147-2153, 2024 | 8 | 2024 |
Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: the Past, Present, and Future Y Song, H Jiang, H Zhang, Z Tian, W Zhang, J Wang arXiv preprint arXiv:2309.12951, 2023 | 6 | 2023 |
Tri-Modal Motion Retrieval by Learning a Joint Embedding Space K Yin, S Zou, Y Ge, Z Tian arXiv preprint arXiv:2403.00691, 2024 | 5 | 2024 |
An Empirical Study on Google Research Football Multi-agent Scenarios Y Song, H Jiang, Z Tian, H Zhang, Y Zhang, J Zhu, Z Dai, W Zhang, ... arXiv preprint arXiv:2305.09458, 2023 | 3 | 2023 |
ROMO: Retrieval-enhanced Offline Model-based Optimization M Chen, H Zhao, Y Zhao, H Fan, H Gao, Y Yu, Z Tian arXiv preprint arXiv:2310.07560, 2023 | 2 | 2023 |
Learning to Safely Exploit a Non-Stationary Opponent Z Tian, H Ren, Y Yang, Y Sun, Z Han, I Davies, J Wang | 2 | 2021 |
Multi-agent trust region learning Y Wen, H Chen, Y Yang, Z Tian, M Li, X Chen, J Wang | 2 | 2020 |