Shangtong Zhang

Citované v

	Všetky	Od 2020
Citácie	1450	1346
h-index	17	17
i10-index	25	24

320

160

240

2017201820192020202120222023202420254 27 65 161 229 292 318 294 52

Verejný prístup

všetky položky

11 článkov

0 článkov

dostupné

nedostupné

Na základe mandátov na financovanie

Spoluautori

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoOverená e-mailová adresa na: cs.ox.ac.uk
Richard S. SuttonKeen, Amii, and University of AlbertaOverená e-mailová adresa na: richsutton.com
Bo LiuPhD, AAAI SM, IEEE SMOverená e-mailová adresa na: cs.umass.edu
Remi Tachet des CombesOverená e-mailová adresa na: alpacaml.com
Romain LarocheWayveOverená e-mailová adresa na: polytechnique.org
Linglong KongProfessor, Canada Research Chair in Statistical Learning, UAlberta, and Canada CIFAR AI Chair, AmiiOverená e-mailová adresa na: ualberta.ca
Ray JiangResearch Scientist, DeepMindOverená e-mailová adresa na: google.com
Wendelin BöhmerSequential Decision Making Group, Delft University of TechnologyOverená e-mailová adresa na: tudelft.nl
Marcus EdelComputer Science, Free University of BerlinOverená e-mailová adresa na: fu-berlin.de
Ryan R. CurtinFree agentOverená e-mailová adresa na: ratml.org
Nando de FreitasCIFAR & DeepMindOverená e-mailová adresa na: google.com
Tom Le PaineStaff Research Scientist at Google DeepMindOverená e-mailová adresa na: google.com
Julian SchrittwieserDeepMindOverená e-mailová adresa na: furidamu.org
Roman RingGoogle DeepMindOverená e-mailová adresa na: deepmind.com
Petko GeorgievGoogle DeepMind, University of CambridgeOverená e-mailová adresa na: cam.ac.uk
Michael MathieuDeepMindOverená e-mailová adresa na: google.com
Aäron van den OordGoogle DeepMindOverená e-mailová adresa na: google.com
Caglar GulcehreAI Researcher, Prof at EPFL, Consultant@Google DeepMind, ex-Staff Research Scientist@Google DeepMindOverená e-mailová adresa na: google.com
Aja HuangDeepMindOverená e-mailová adresa na: google.com
Sherjil OzairFounder and CEO, General AgentsOverená e-mailová adresa na: generalagents.com

Sledovať

Shangtong Zhang

University of Virginia

Overená e-mailová adresa na: virginia.edu - Domovská stránka

reinforcement learning stochastic approximation


Názov Zoradiť podľa citácií Zoradiť podľa roka Zoradiť podľa názvu	Citované v Citované v	Rok
A Deeper Look at Experience Replay S Zhang, RS Sutton Deep Reinforcement Learning Symposium, NIPS 2017, 2017	395	2017
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values S Zhang, B Liu, S Whiteson ICML 2020, 2020	115	2020
Distributional Reinforcement Learning for Efficient Exploration B Mavrin, S Zhang, H Yao, L Kong, K Wu, Y Yu ICML 2019, 2019	111	2019
DAC: The Double Actor-Critic Architecture for Learning Options S Zhang, S Whiteson NeurIPS 2019, 2019	97	2019
mlpack 3: a fast, flexible machine learning library R Curtin, M Edel, M Lozhnikov, Y Mentekidis, S Ghaisas, S Zhang Journal of Open Source Software 3 (26), 726, 2018	90	2018
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation S Zhang, B Liu, H Yao, S Whiteson ICML 2020, 2019	65	2019
Generalized Off-Policy Actor-Critic S Zhang, W Boehmer, S Whiteson NeurIPS 2019, 2019	59	2019
Breaking the Deadly Triad with a Target Network S Zhang, H Yao, S Whiteson ICML 2021, 2021	58	2021
Mean-variance policy iteration for risk-averse reinforcement learning S Zhang, B Liu, S Whiteson Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10905 …, 2021	52	2021
Average-Reward Off-Policy Policy Evaluation with Function Approximation S Zhang, Y Wan, RS Sutton, S Whiteson ICML 2021, 2021	44	2021
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ... arXiv preprint arXiv:2308.03526, 2023	39*	2023
QUOTA: The Quantile Option Architecture for Reinforcement Learning S Zhang, B Mavrin, L Kong, B Liu, H Yao AAAI 2019, 2018	35	2018
Deep Residual Reinforcement Learning S Zhang, W Boehmer, S Whiteson AAMAS 2020, 2019	34	2019
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search S Zhang, H Chen, H Yao AAAI 2019, 2018	32	2018
Modularized Implementation of Deep RL Algorithms in PyTorch S Zhang	31*	2018
A deep neural network for modeling music P Zhang, X Zheng, W Zhang, S Li, S Qian, W He, S Zhang, Z Wang Proceedings of the 5th ACM on International Conference on Multimedia …, 2015	30	2015
On the Convergence of SARSA with Linear Function Approximation S Zhang, RT Des Combes, R Laroche ICML 2023, 2023	17*	2023
Learning expected emphatic traces for deep RL R Jiang, S Zhang, V Chelu, A White, H van Hasselt Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 7015-7023, 2022	16	2022
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch S Zhang, R Tachet, R Laroche Journal of Machine Learning Research, 2021	16	2021
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards Y Song, J Wang, T Lukasiewicz, Z Xu, S Zhang, M Xu AAAI 2020, 2019	16	2019

Systém momentálne nemôže vykonať operáciu. Skúste to neskôr.

Články 1–20

Citácie za rok

Duplicitné citácie

Zlúčené citácie

Pridať spoluautorovSpoluautori

Sledovať

Citované v

Spoluautori