Haifeng Zhang

Viittaukset

	Kaikki	2020 lähtien
Sitaatit	1153	1094
h-indeksi	16	15
i10-indeksi	21	19

440

220

110

330

201520162017201820192020202120222023202420254 3 13 10 27 30 70 190 296 423 84

Yleisessä käytössä

Näytä kaikki

14 artikkelia

3 artikkelia

käytettävissä

ei käytettävissä

Perustuu rahoitusehtoihin

Muut kirjoittajat

Jun WangProfessor, Computer Science, University College LondonVahvistettu sähköpostiosoite verkkotunnuksessa cs.ucl.ac.uk
Weinan ZhangProfessor, Shanghai Jiao Tong UniversityVahvistettu sähköpostiosoite verkkotunnuksessa sjtu.edu.cn
Yong Yu (俞勇)Professor, Shanghai Jiao Tong UniversityVahvistettu sähköpostiosoite verkkotunnuksessa sjtu.edu.cn
Ying WenAssociate Professor, Shanghai Jiao Tong UniversityVahvistettu sähköpostiosoite verkkotunnuksessa sjtu.edu.cn
Kan RenAssistant Professor, ShanghaiTech UniversityVahvistettu sähköpostiosoite verkkotunnuksessa shanghaitech.edu.cn
Zhiming ZhouShanghai University of Finance and EconomicsVahvistettu sähköpostiosoite verkkotunnuksessa mail.shufe.edu.cn

Seuraa

Haifeng Zhang

Institute of Automation, Chinese Academy of Sciences

Vahvistettu sähköpostiosoite verkkotunnuksessa ia.ac.cn - Kotisivu

reinforcement learning computational advertising


Nimike Lajittele sitaattien mukaan Lajittele vuoden mukaan Lajittele otsikon mukaan	Viittaukset Viittaukset	Vuosi
Improving knowledge tracing via pre-training question embeddings Y Liu, Y Yang, X Chen, J Shen, H Zhang, Y Yu arXiv preprint arXiv:2012.05031, 2020	146	2020
Learn to navigate: cooperative path planning for unmanned surface vehicles using deep reinforcement learning X Zhou, P Wu, H Zhang, W Guo, Y Liu Ieee Access 7, 165262-165278, 2019	146	2019
Offline pre-trained multi-agent decision transformer L Meng, M Wen, C Le, X Li, D Xing, W Zhang, Y Wen, H Zhang, J Wang, ... Machine Intelligence Research 20 (2), 233-248, 2023	110	2023
Bi-level actor-critic for multi-agent coordination H Zhang, W Chen, Z Huang, M Li, Y Yang, W Zhang, J Wang Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7325-7332, 2020	104	2020
Learning correlated communication topology in multi-agent reinforcement learning Y Du, B Liu, V Moens, Z Liu, Z Ren, J Wang, X Chen, H Zhang Proceedings of the 20th International Conference on Autonomous Agents and …, 2021	76	2021
Settling the variance of multi-agent policy gradients JG Kuba, M Wen, L Meng, H Zhang, D Mguni, J Wang, Y Yang Advances in Neural Information Processing Systems 34, 13458-13470, 2021	71	2021
User response learning for directly optimizing campaign performance in display advertising K Ren, W Zhang, Y Rong, H Zhang, Y Yu, J Wang Proceedings of the 25th acm international on conference on information and …, 2016	54	2016
GCS: Graph-based coordination strategy for multi-agent reinforcement learning J Ruan, Y Du, X Xiong, D Xing, X Li, L Meng, H Zhang, J Wang, B Xu arXiv preprint arXiv:2201.06257, 2022	48	2022
Token-level direct preference optimization Y Zeng, G Liu, W Ma, N Yang, H Zhang, J Wang arXiv preprint arXiv:2404.11999, 2024	46	2024
Large language models play starcraft ii: Benchmarks and a chain of summarization approach W Ma, Q Mi, Y Zeng, X Yan, R Lin, Y Wu, J Wang, H Zhang Advances in Neural Information Processing Systems 37, 133386-133442, 2025	42	2025
Large sequence models for sequential decision-making: a survey M Wen, R Lin, H Wang, Y Yang, Y Wen, L Mai, J Wang, H Zhang, ... Frontiers of Computer Science 17 (6), 176349, 2023	36	2023
A review: machine learning for combinatorial optimization problems in energy areas X Yang, Z Wang, H Zhang, N Ma, N Yang, H Liu, H Zhang, L Yang Algorithms 15 (6), 205, 2022	29	2022
Botzone: an online multi-agent competitive platform for ai education H Zhou, H Zhang, Y Zhou, X Wang, W Li Proceedings of the 23rd Annual ACM Conference on Innovation and Technology …, 2018	28	2018
Layout design for intelligent warehouse by evolution with fitness approximation H Zhang, Z Guo, W Zhang, H Cai, C Wang, Y Yu, W Li, J Wang IEEE Access 7, 166310-166317, 2019	24	2019
Learning to design games: Strategic environments in reinforcement learning H Zhang, J Wang, Z Zhou, W Zhang, Y Wen, Y Yu, W Li Proceedings of the 27th international joint conference on Artificial …, 2017	18	2017
A game-theoretic approach for improving generalization ability of TSP solvers C Wang, Y Yang, O Slumbers, C Han, T Guo, H Zhang, J Wang arXiv preprint arXiv:2110.15105, 2021	17	2021
Managing risk of bidding in display advertising H Zhang, W Zhang, Y Rong, K Ren, W Li, J Wang Proceedings of the Tenth ACM International Conference on Web Search and Data …, 2017	15	2017
Estimating -Rank from A Few Entries with Low Rank Matrix Completion Y Du, X Yan, X Chen, J Wang, H Zhang International Conference on Machine Learning, 2870-2879, 2021	13	2021
Botzone: A competitive and interactive platform for game AI education H Zhou, Y Zhou, H Zhang, H Huang, W Li Proceedings of the ACM turing 50th celebration conference-China, 1-5, 2017	11	2017
A theoretical understanding of gradient bias in meta-reinforcement learning B Liu, X Feng, J Ren, L Mai, R Zhu, H Zhang, J Wang, Y Yang Advances in Neural Information Processing Systems 35, 31059-31072, 2022	10	2022

Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.

Artikkelit 1–20

Sitaatteja vuodessa

Päällekkäiset lähteet

Yhdistetyt sitaatit

Lisää muut kirjoittajatMuut kirjoittajat

Seuraa

Viittaukset

Muut kirjoittajat