Qi Cai

Citirano

	Sve	Od 2020.
Citati	1218	1197
H-indeks	11	11
i10-indeks	12	12

280

140

210

201920202021202220232024202519 130 238 268 277 242 42

Javni pristup

Prikaži sve

1 članak

0 članaka

dostupno

nije dostupno

Na temelju uvjeta financiranja

Prati

Qi Cai

Northwestern University

Potvrđena adresa e-pošte na u.northwestern.edu

reinforcement learning optimization machine learning


Naslov Poredaj po navodima Poredaj po godini Poredaj po naslovu	Citirano Citirano	Godina
Provably efficient exploration in policy optimization Q Cai, Z Yang, C Jin, Z Wang International Conference on Machine Learning, 1283-1294, 2020	324	2020
Neural policy gradient methods: Global optimality and rates of convergence L Wang, Q Cai, Z Yang, Z Wang International Conference on Learning Representations 2020, 2019	270	2019
Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy B Liu, Q Cai, Z Yang, Z Wang Advances in Neural Information Processing Systems, 10564-10575, 2019	236	2019
Neural temporal-difference learning converges to global optima Q Cai, Z Yang, JD Lee, Z Wang Advances in Neural Information Processing Systems 32, 2019	157	2019
On the Global Optimality of Model-Agnostic Meta-Learning: Reinforcement Learning and Supervised Learning L Wang, Q Cai, Z Yang, Z Wang International Conference on Machine Learning, 9837-9846, 2020	52	2020
On the global convergence of imitation learning: A case for linear quadratic regulator Q Cai, M Hong, Y Chen, Z Wang arXiv preprint arXiv:1901.03674, 2019	39	2019
Generative adversarial imitation learning with neural network parameterization: Global optimality and convergence rate Y Zhang, Q Cai, Z Yang, Z Wang International conference on machine learning, 11044-11054, 2020	35*	2020
Reinforcement learning from partial observation: Linear function approximation with provable sample efficiency Q Cai, Z Yang, Z Wang International Conference on Machine Learning, 2485-2522, 2022	33*	2022
Represent to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency L Wang, Q Cai, Z Yang, Z Wang The Eleventh International Conference on Learning Representations, 0	29*
Provably efficient offline reinforcement learning for partially observable Markov decision processes H Guo, Q Cai, Y Zhang, Z Yang, Z Wang International Conference on Machine Learning, 8016-8038, 2022	18	2022
Can Temporal-Diﬀerence and Q-Learning Learn Representation? A Mean-Field Theory Y Zhang, Q Cai, Z Yang, Y Chen, Z Wang Advances in Neural Information Processing Systems 33, 19680-19692, 2020	13	2020
An analysis of attention via the lens of exchangeability and latent variable models Y Zhang, B Liu, Q Cai, L Wang, Z Wang arXiv preprint arXiv:2212.14852, 2022	11	2022
Optimistic Policy Optimization with General Function Approximations Q Cai, Z Yang, C Szepesvari, Z Wang	1	2021
Provably Efficient Reinforcement Learning Q Cai Northwestern University, 2022		2022
BooVI: provably efficient bootstrapped value iteration B Liu, Q Cai, Z Yang, Z Wang Advances in Neural Information Processing Systems 34, 7041-7053, 2021		2021

Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.

Članci 1–15

Godišnji broj citata

Dvostruki navodi

Spojeni navodi

Dodavanje suautoraSuautori

Prati

Citirano