Seuraa
Haifeng Zhang
Haifeng Zhang
Vahvistettu sähköpostiosoite verkkotunnuksessa ia.ac.cn - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
Improving knowledge tracing via pre-training question embeddings
Y Liu, Y Yang, X Chen, J Shen, H Zhang, Y Yu
arXiv preprint arXiv:2012.05031, 2020
1462020
Learn to navigate: cooperative path planning for unmanned surface vehicles using deep reinforcement learning
X Zhou, P Wu, H Zhang, W Guo, Y Liu
Ieee Access 7, 165262-165278, 2019
1462019
Offline pre-trained multi-agent decision transformer
L Meng, M Wen, C Le, X Li, D Xing, W Zhang, Y Wen, H Zhang, J Wang, ...
Machine Intelligence Research 20 (2), 233-248, 2023
1102023
Bi-level actor-critic for multi-agent coordination
H Zhang, W Chen, Z Huang, M Li, Y Yang, W Zhang, J Wang
Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7325-7332, 2020
1042020
Learning correlated communication topology in multi-agent reinforcement learning
Y Du, B Liu, V Moens, Z Liu, Z Ren, J Wang, X Chen, H Zhang
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
762021
Settling the variance of multi-agent policy gradients
JG Kuba, M Wen, L Meng, H Zhang, D Mguni, J Wang, Y Yang
Advances in Neural Information Processing Systems 34, 13458-13470, 2021
712021
User response learning for directly optimizing campaign performance in display advertising
K Ren, W Zhang, Y Rong, H Zhang, Y Yu, J Wang
Proceedings of the 25th acm international on conference on information and …, 2016
542016
GCS: Graph-based coordination strategy for multi-agent reinforcement learning
J Ruan, Y Du, X Xiong, D Xing, X Li, L Meng, H Zhang, J Wang, B Xu
arXiv preprint arXiv:2201.06257, 2022
482022
Token-level direct preference optimization
Y Zeng, G Liu, W Ma, N Yang, H Zhang, J Wang
arXiv preprint arXiv:2404.11999, 2024
462024
Large language models play starcraft ii: Benchmarks and a chain of summarization approach
W Ma, Q Mi, Y Zeng, X Yan, R Lin, Y Wu, J Wang, H Zhang
Advances in Neural Information Processing Systems 37, 133386-133442, 2025
422025
Large sequence models for sequential decision-making: a survey
M Wen, R Lin, H Wang, Y Yang, Y Wen, L Mai, J Wang, H Zhang, ...
Frontiers of Computer Science 17 (6), 176349, 2023
362023
A review: machine learning for combinatorial optimization problems in energy areas
X Yang, Z Wang, H Zhang, N Ma, N Yang, H Liu, H Zhang, L Yang
Algorithms 15 (6), 205, 2022
292022
Botzone: an online multi-agent competitive platform for ai education
H Zhou, H Zhang, Y Zhou, X Wang, W Li
Proceedings of the 23rd Annual ACM Conference on Innovation and Technology …, 2018
282018
Layout design for intelligent warehouse by evolution with fitness approximation
H Zhang, Z Guo, W Zhang, H Cai, C Wang, Y Yu, W Li, J Wang
IEEE Access 7, 166310-166317, 2019
242019
Learning to design games: Strategic environments in reinforcement learning
H Zhang, J Wang, Z Zhou, W Zhang, Y Wen, Y Yu, W Li
Proceedings of the 27th international joint conference on Artificial …, 2017
182017
A game-theoretic approach for improving generalization ability of TSP solvers
C Wang, Y Yang, O Slumbers, C Han, T Guo, H Zhang, J Wang
arXiv preprint arXiv:2110.15105, 2021
172021
Managing risk of bidding in display advertising
H Zhang, W Zhang, Y Rong, K Ren, W Li, J Wang
Proceedings of the Tenth ACM International Conference on Web Search and Data …, 2017
152017
Estimating -Rank from A Few Entries with Low Rank Matrix Completion
Y Du, X Yan, X Chen, J Wang, H Zhang
International Conference on Machine Learning, 2870-2879, 2021
132021
Botzone: A competitive and interactive platform for game AI education
H Zhou, Y Zhou, H Zhang, H Huang, W Li
Proceedings of the ACM turing 50th celebration conference-China, 1-5, 2017
112017
A theoretical understanding of gradient bias in meta-reinforcement learning
B Liu, X Feng, J Ren, L Mai, R Zhu, H Zhang, J Wang, Y Yang
Advances in Neural Information Processing Systems 35, 31059-31072, 2022
102022
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20