Sledovat
Yue Wang
Název
Citace
Citace
Rok
Online Robust Reinforcement Learning with Model Uncertainty
Y Wang, S Zou
Advances in Neural Information Processing Systems 34, 2021
1152021
Policy gradient method for robust reinforcement learning
Y Wang, S Zou
International Conference on Machine Learning, 23484-23526, 2022
762022
A Robust and Constrained Multi-Agent Reinforcement Learning Electric Vehicle Rebalancing Method in AMoD Systems
S He, Y Wang, S Han, S Zou, F Miao
2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023
422023
Finite-sample analysis of Greedy-GQ with linear function approximation under Markovian noise
Y Wang, S Zou
Conference on Uncertainty in Artificial Intelligence, 11-20, 2020
282020
Provably efficient offline reinforcement learning with trajectory-wise reward
T Xu, Y Wang, S Zou, Y Liang
IEEE Transactions on Information Theory, 2024
202024
Non-asymptotic analysis for two time-scale TDC with general smooth function approximation
Y Wang, S Zou, Y Zhou
Advances in Neural Information Processing Systems 34, 9747-9758, 2021
20*2021
Robust constrained reinforcement learning
Y Wang, F Miao, S Zou
arXiv preprint arXiv:2209.06866, 2022
142022
Robust average-reward Markov decision processes
Y Wang, A Velasquez, G Atia, A Prater-Bennette, S Zou
AAAI 2023, 2023
102023
Model-free robust average-reward reinforcement learning
Y Wang, A Velasquez, GK Atia, A Prater-Bennette, S Zou
International Conference on Machine Learning, 36431-36469, 2023
92023
Robust Average-Reward Reinforcement Learning
Y Wang, A Velasquez, G Atia, A Prater-Bennette, S Zou
Journal of Artificial Intelligence Research 80, 719-803, 2024
22024
Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation
Y Wang, Y Wang, Y Zhou, S Zou
ICML 2024, 2024
22024
Achieving the Asymptotically Optimal Sample Complexity of Offline Reinforcement Learning: A DRO-Based Approach
Y Wang, J Xiong, S Zou
Transactions on Machine Learning Research, 2024
2*2024
Finite-time error bounds for Greedy-GQ
Y Wang, Y Zhou, S Zou
Machine Learning 113 (9), 5981-6018, 2024
12024
Data-driven robust multi-agent reinforcement learning
Y Wang, Y Wang, Y Zhou, A Velasquez, S Zou
2022 IEEE 32nd International Workshop on Machine Learning for Signal …, 2022
12022
A Unified Principle of Pessimism for Offline Reinforcement Learning under Model Mismatch
Y Wang, Z Sun, S Zou
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
2024
Model-Free Robust Reinforcement Learning with Sample Complexity Analysis
Y Wang, S Zou, Y Wang
The 40th Conference on Uncertainty in Artificial Intelligence, 2024
2024
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–16