Seuraa
Runzhe Wan
Runzhe Wan
Vahvistettu sähköpostiosoite verkkotunnuksessa amazon.com - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
Batch policy learning in average reward markov decision processes
P Liao, Z Qi, R Wan, P Klasnja, S Murphy
arXiv preprint arXiv:2007.11771, 2022
952022
Does the Markov decision process fit the data: testing for the Markov property in sequential decision making
C Shi, R Wan, R Song, W Lu, L Leng
International Conference on Machine Learning, 8807-8817, 2020
492020
Multi-Objective Model-based Reinforcement Learning for Infectious Disease Control
R Wan, X Zhang, R Song
Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data …, 2021
412021
Deeply-debiased off-policy interval estimation
C Shi, R Wan, V Chernozhukov, R Song
International Conference on Machine Learning, 9580-9591, 2021
392021
Metadata-based multi-task bandits with bayesian hierarchical models
R Wan, L Ge, R Song
Advances in Neural Information Processing Systems 34, 29655-29668, 2021
322021
A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets
C Shi, R Wan, G Song, S Luo, H Zhu, R Song
The Annals of Applied Statistics 17 (4), 2701-2722, 2023
182023
Towards scalable and robust structured bandits: A meta-learning framework
R Wan, L Ge, R Song
International Conference on Artificial Intelligence and Statistics, 1144-1173, 2023
152023
Safe Exploration for Efficient Policy Evaluation and Comparison
R Wan, B Kveton, R Song
International Conference on Machine Learning, 22491-22511, 2022
152022
Mining the factor zoo: Estimation of latent factor models with sufficient proxies
R Wan, Y Li, W Lu, R Song
Journal of Econometrics 239 (2), 105386, 2024
72024
STEEL: Singularity-aware Reinforcement Learning
X Chen, Z Qi, R Wan
arXiv preprint arXiv:2301.13152, 2023
72023
Robust offline reinforcement learning with heavy-tailed rewards
J Zhu, R Wan, Z Qi, S Luo, C Shi
International Conference on Artificial Intelligence and Statistics, 541-549, 2024
5*2024
Experimentation platforms meet reinforcement learning: Bayesian sequential decision-making for continuous monitoring
R Wan, Y Liu, J McQueen, D Hains, R Song
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023
52023
Multiplier bootstrap-based exploration
R Wan, H Wei, B Kveton, R Song
International Conference on Machine Learning, 35444-35490, 2023
42023
Pattern Transfer Learning for Reinforcement Learning in Order Dispatching
R Wan, S Zhang, C Shi, S Luo, R Song
arXiv preprint arXiv:2105.13218, 2021
32021
A Review of Reinforcement Learning in Financial Applications
Y Bai, Y Gao, R Wan, S Zhang, R Song
Annual Review of Statistics and Its Application 12, 2024
12024
Effect size estimation for duration recommendation in online experiments: Leveraging hierarchical models and objective utility approaches
Y Liu, R Wan, J McQueen, D Hains, J Gu, R Song
Proceedings of the AAAI Conference on Artificial Intelligence 38 (12), 14044 …, 2024
12024
Zero-Inflated Bandits
H Wei, R Wan, L Shi, R Song
arXiv preprint arXiv:2312.15595, 2023
12023
Advances in Statistical Inference and Policy Optimization for Reinforcement Learning
R Wan
North Carolina State University, 2022
12022
Know when to fold: Futility-aware early termination in online experiments
Y Liu, R Wan, Y Huang, J McQueen, D Hains, J Gu, R Song
2025
Online testing efficiency through early termination
Y Liu, R Wan, J McQueen, D Hains, R Song, RH Castillo
US Patent 11,909,829, 2024
2024
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20