Sledovať
Shangtong Zhang
Shangtong Zhang
Overená e-mailová adresa na: virginia.edu - Domovská stránka
Názov
Citované v
Citované v
Rok
A Deeper Look at Experience Replay
S Zhang, RS Sutton
Deep Reinforcement Learning Symposium, NIPS 2017, 2017
3952017
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
S Zhang, B Liu, S Whiteson
ICML 2020, 2020
1152020
Distributional Reinforcement Learning for Efficient Exploration
B Mavrin, S Zhang, H Yao, L Kong, K Wu, Y Yu
ICML 2019, 2019
1112019
DAC: The Double Actor-Critic Architecture for Learning Options
S Zhang, S Whiteson
NeurIPS 2019, 2019
972019
mlpack 3: a fast, flexible machine learning library
R Curtin, M Edel, M Lozhnikov, Y Mentekidis, S Ghaisas, S Zhang
Journal of Open Source Software 3 (26), 726, 2018
902018
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation
S Zhang, B Liu, H Yao, S Whiteson
ICML 2020, 2019
652019
Generalized Off-Policy Actor-Critic
S Zhang, W Boehmer, S Whiteson
NeurIPS 2019, 2019
592019
Breaking the Deadly Triad with a Target Network
S Zhang, H Yao, S Whiteson
ICML 2021, 2021
582021
Mean-variance policy iteration for risk-averse reinforcement learning
S Zhang, B Liu, S Whiteson
Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10905 …, 2021
522021
Average-Reward Off-Policy Policy Evaluation with Function Approximation
S Zhang, Y Wan, RS Sutton, S Whiteson
ICML 2021, 2021
442021
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ...
arXiv preprint arXiv:2308.03526, 2023
39*2023
QUOTA: The Quantile Option Architecture for Reinforcement Learning
S Zhang, B Mavrin, L Kong, B Liu, H Yao
AAAI 2019, 2018
352018
Deep Residual Reinforcement Learning
S Zhang, W Boehmer, S Whiteson
AAMAS 2020, 2019
342019
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search
S Zhang, H Chen, H Yao
AAAI 2019, 2018
322018
Modularized Implementation of Deep RL Algorithms in PyTorch
S Zhang
31*2018
A deep neural network for modeling music
P Zhang, X Zheng, W Zhang, S Li, S Qian, W He, S Zhang, Z Wang
Proceedings of the 5th ACM on International Conference on Multimedia …, 2015
302015
On the Convergence of SARSA with Linear Function Approximation
S Zhang, RT Des Combes, R Laroche
ICML 2023, 2023
17*2023
Learning expected emphatic traces for deep RL
R Jiang, S Zhang, V Chelu, A White, H van Hasselt
Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 7015-7023, 2022
162022
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch
S Zhang, R Tachet, R Laroche
Journal of Machine Learning Research, 2021
162021
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards
Y Song, J Wang, T Lukasiewicz, Z Xu, S Zhang, M Xu
AAAI 2020, 2019
162019
Systém momentálne nemôže vykonať operáciu. Skúste to neskôr.
Články 1–20