Hongming Zhang

Citeret af

	Alle	Siden 2020
Henvisninger	493	493
h-index	6	6
i10-indeks	5	5

180

135

2020202120222023202420256 53 86 137 176 30

Offentlig adgang

Se alle

3 artikler

0 artikler

tilgængelige

ikke tilgængelige

Baseret på krav i forbindelse med finansiering

Medforfattere

Hao DongAssistant Professor at Peking UniversityVerificeret mail på pku.edu.cn
Zihan DingPrinceton UniversityVerificeret mail på princeton.edu
Fengshuo BaiShanghai Jiao Tong UniversityVerificeret mail på sjtu.edu.cn
Martin MüllerProfessor, Computing Science, University of AlbertaVerificeret mail på ualberta.ca
Jun JinAssistant Professor at the University of AlbertaVerificeret mail på ualberta.ca
Ke Sun (孙科)Postdoc, Harvard UniversityVerificeret mail på fas.harvard.edu
Linglong KongProfessor, Canada Research Chair in Statistical Learning, UAlberta, and Canada CIFAR AI Chair, AmiiVerificeret mail på ualberta.ca
Chao GaoHuawei Canada Research CenterVerificeret mail på huawei.com
Dale SchuurmansUniversity of Alberta, Google DeepMindVerificeret mail på cs.ualberta.ca
Tongzheng RenCitadel SecuritiesVerificeret mail på utexas.edu
Bo DaiGoogle Brain & Georgia TechVerificeret mail på google.com

Følg

Hongming Zhang

University of Alberta

Verificeret mail på ualberta.ca - Startside

reinforcement learning tree search statistical machine learning


Titel Sortér efter henvisninger Sortér efter årstal Sortér efter titel	Citeret af Citeret af	År
Deep Reinforcement Learning: Fundamentals, Research, and Applications H Dong, Z Ding, S Zhang, H Yuan, H Zhang, J Zhang, Y Huang, T Yu, ... Springer Singapore, 2020	311	2020
Taxonomy of reinforcement learning algorithms H Zhang, T Yu Deep reinforcement learning: Fundamentals, research and applications, 125-133, 2020	99	2020
AlphaZero H Zhang, T Yu Deep Reinforcement Learning: Fundamentals, Research and Applications, 391-415, 2020	27	2020
Efficient reinforcement learning development with rlzoo Z Ding, T Yu, H Zhang, Y Huang, G Li, Q Guo, L Mai, H Dong Proceedings of the 29th ACM International Conference on Multimedia, 3759-3762, 2021	18*	2021
Picor: Multi-task deep reinforcement learning with policy correction F Bai, H Zhang, T Tao, Z Wu, Y Wang, B Xu Proceedings of the AAAI Conference on Artificial Intelligence 37 (6), 6728-6736, 2023	16	2023
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay H Zhang, C Xiao, H Wang, J Jin, B Xu, M Müller The Eleventh International Conference on Learning Representations, 2023	9	2023
Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning H Zhang, T Ren, C Xiao, D Schuurmans, B Dai Forty-first International Conference on Machine Learning, 2024	5	2024
Combine Deep Q-Networks with Actor-Critic H Zhang, T Yu, R Huang Deep Reinforcement Learning: Fundamentals, Research and Applications, 213-245, 2020	4	2020
A Distance-based Anomaly Detection Framework for Deep Reinforcement Learning H Zhang, K Sun, B Xu, L Kong, M Müller Transactions on Machine Learning Research, 2024	2*	2024
A logarithmic barrier method for proximal policy optimization C Zeng, H Zhang arXiv preprint arXiv:1812.06502, 2018	2	2018
Latent Landmark Graph for Efficient Exploration-exploitation Balance in Hierarchical Reinforcement Learning Q Zhang, H Zhang, D Xing, B Xu Machine Intelligence Research, 1-22, 2025		2025
-DQN: Improving Deep Q-Learning By Evolving the Behavior H Zhang, F Bai, C Xiao, C Gao, B Xu, M Müller The 24th International Conference on Autonomous Agents and Multiagent …, 2025		2025
Monte Carlo Tree Search in the Presence of Transition Uncertainty F Kohankhaki, K Aghakasiri, H Zhang, TH Wei, C Gao, M Müller Proceedings of the AAAI Conference on Artificial Intelligence 38 (18), 20151 …, 2024		2024
Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP Iteration H Zhang, C Xiao, C Gao, H Wang, X Bo, M Müller The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024		2024
Build generally reusable agent-environment interaction models J Jin, H Zhang, J Luo arXiv preprint arXiv:2211.08234, 2022		2022
Iterative Update and Unified Representation for Multi-Agent Reinforcement Learning J Long, H Zhang, T Yu, B Xu arXiv preprint arXiv:1908.06758, 2019		2019
RevCuT Tree Search Method in Complex Single-player Game with Continuous Search Space H Zhang, F Cheng, B Xu, F Chen, J Liu, W Wu 2019 International Joint Conference on Neural Networks (IJCNN), 1-8, 2019		2019

Systemet kan ikke foretage handlingen nu. Prøv igen senere.

Artikler 1–17

Henvisninger pr. år

Dublerede henvisninger

Flettede henvisninger

Tilføj medforfattereMedforfattere

Følg

Citeret af

Medforfattere