Improving knowledge tracing via pre-training question embeddings Y Liu, Y Yang, X Chen, J Shen, H Zhang, Y Yu
arXiv preprint arXiv:2012.05031, 2020
146 2020 Learn to navigate: cooperative path planning for unmanned surface vehicles using deep reinforcement learning X Zhou, P Wu, H Zhang, W Guo, Y Liu
Ieee Access 7, 165262-165278, 2019
146 2019 Offline pre-trained multi-agent decision transformer L Meng, M Wen, C Le, X Li, D Xing, W Zhang, Y Wen, H Zhang, J Wang, ...
Machine Intelligence Research 20 (2), 233-248, 2023
110 2023 Bi-level actor-critic for multi-agent coordination H Zhang, W Chen, Z Huang, M Li, Y Yang, W Zhang, J Wang
Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7325-7332, 2020
104 2020 Learning correlated communication topology in multi-agent reinforcement learning Y Du, B Liu, V Moens, Z Liu, Z Ren, J Wang, X Chen, H Zhang
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
76 2021 Settling the variance of multi-agent policy gradients JG Kuba, M Wen, L Meng, H Zhang, D Mguni, J Wang, Y Yang
Advances in Neural Information Processing Systems 34, 13458-13470, 2021
71 2021 User response learning for directly optimizing campaign performance in display advertising K Ren, W Zhang, Y Rong, H Zhang, Y Yu, J Wang
Proceedings of the 25th acm international on conference on information and …, 2016
54 2016 GCS: Graph-based coordination strategy for multi-agent reinforcement learning J Ruan, Y Du, X Xiong, D Xing, X Li, L Meng, H Zhang, J Wang, B Xu
arXiv preprint arXiv:2201.06257, 2022
48 2022 Token-level direct preference optimization Y Zeng, G Liu, W Ma, N Yang, H Zhang, J Wang
arXiv preprint arXiv:2404.11999, 2024
46 2024 Large language models play starcraft ii: Benchmarks and a chain of summarization approach W Ma, Q Mi, Y Zeng, X Yan, R Lin, Y Wu, J Wang, H Zhang
Advances in Neural Information Processing Systems 37, 133386-133442, 2025
42 2025 Large sequence models for sequential decision-making: a survey M Wen, R Lin, H Wang, Y Yang, Y Wen, L Mai, J Wang, H Zhang, ...
Frontiers of Computer Science 17 (6), 176349, 2023
36 2023 A review: machine learning for combinatorial optimization problems in energy areas X Yang, Z Wang, H Zhang, N Ma, N Yang, H Liu, H Zhang, L Yang
Algorithms 15 (6), 205, 2022
29 2022 Botzone: an online multi-agent competitive platform for ai education H Zhou, H Zhang, Y Zhou, X Wang, W Li
Proceedings of the 23rd Annual ACM Conference on Innovation and Technology …, 2018
28 2018 Layout design for intelligent warehouse by evolution with fitness approximation H Zhang, Z Guo, W Zhang, H Cai, C Wang, Y Yu, W Li, J Wang
IEEE Access 7, 166310-166317, 2019
24 2019 Learning to design games: Strategic environments in reinforcement learning H Zhang, J Wang, Z Zhou, W Zhang, Y Wen, Y Yu, W Li
Proceedings of the 27th international joint conference on Artificial …, 2017
18 2017 A game-theoretic approach for improving generalization ability of TSP solvers C Wang, Y Yang, O Slumbers, C Han, T Guo, H Zhang, J Wang
arXiv preprint arXiv:2110.15105, 2021
17 2021 Managing risk of bidding in display advertising H Zhang, W Zhang, Y Rong, K Ren, W Li, J Wang
Proceedings of the Tenth ACM International Conference on Web Search and Data …, 2017
15 2017 Estimating -Rank from A Few Entries with Low Rank Matrix Completion Y Du, X Yan, X Chen, J Wang, H Zhang
International Conference on Machine Learning, 2870-2879, 2021
13 2021 Botzone: A competitive and interactive platform for game AI education H Zhou, Y Zhou, H Zhang, H Huang, W Li
Proceedings of the ACM turing 50th celebration conference-China, 1-5, 2017
11 2017 A theoretical understanding of gradient bias in meta-reinforcement learning B Liu, X Feng, J Ren, L Mai, R Zhu, H Zhang, J Wang, Y Yang
Advances in Neural Information Processing Systems 35, 31059-31072, 2022
10 2022