Segui
Zuyue Fu
Titolo
Citata da
Citata da
Anno
Actor-critic provably finds Nash equilibria of linear-quadratic mean-field games
Z Fu, Z Yang, Y Chen, Z Wang
International Conference on Learning Representations, 2019
702019
Instrumental variable value iteration for causal offline reinforcement learning
L Liao, Z Fu, Z Yang, Y Wang, D Ma, M Kolar, Z Wang
Journal of Machine Learning Research 25 (303), 1-56, 2024
502024
Single-timescale actor-critic provably finds globally optimal policy
Z Fu, Z Yang, Z Wang
International Conference on Learning Representations, 2020
482020
Learning from demonstration: Provably efficient adversarial policy imitation with linear function approximation
Z Liu, Y Zhang, Z Fu, Z Yang, Z Wang
International conference on machine learning, 14094-14138, 2022
25*2022
Offline reinforcement learning with instrumental variables in confounded markov decision processes
Z Fu, Z Qi, Z Wang, Z Yang, Y Xu, MR Kosorok
arXiv preprint arXiv:2209.08666, 2022
242022
False correlation reduction for offline reinforcement learning
Z Deng, Z Fu, L Wang, Z Yang, C Bai, T Zhou, Z Wang, J Jiang
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
17*2023
Decentralized single-timescale actor-critic on zero-sum two-player stochastic games
H Guo, Z Fu, Z Yang, Z Wang
International Conference on Machine Learning, 3899-3909, 2021
112021
Sample elicitation
J Wei, Z Fu, Y Liu, X Li, Z Yang, Z Wang
International Conference on Artificial Intelligence and Statistics, 2692-2700, 2021
92021
Convergent reinforcement learning with function approximation: A bilevel optimization perspective
Z Yang, Z Fu, K Zhang, Z Wang
72018
Optimistic exploration with learned features provably solves markov decision processes with neural dynamics
S Zheng, L Wang, S Qiu, Z Fu, Z Yang, C Szepesvari, Z Wang
The Eleventh International Conference on Learning Representations, 2022
32022
A two-fold structural classification method for determining the accurate ensemble of protein structures
P Tan, Z Fu, L Petridis, S Qian, D You, D Wei, J Li, L Hong
Communications in Computational Physics 25 (4), 2018
12018
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Z Fu, Z Qi, Z Yang, Z Wang, L Wang
arXiv preprint arXiv:2212.12167, 2022
2022
On the Optimality and Complexity of Reinforcement Learning
Z Fu
Northwestern University, 2022
2022
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–13