Qi Cai

Dikutip oleh

	Semua	Sejak 2020
Kutipan	1224	1205
indeks-h	11	11
indeks-i10	12	12

300

150

225

201920202021202220232024202517 131 241 265 256 292 20

Akses publik

Lihat semua

1 artikel

0 artikel

tersedia

tidak tersedia

Berdasarkan pada mandat pendanaan

Ikuti

Qi Cai

Northwestern University

Email yang diverifikasi di u.northwestern.edu

reinforcement learning optimization machine learning


Judul Urutkan menurut kutipan Urutkan menurut tahun Urutkan menurut judul	Dikutip oleh Dikutip oleh	Tahun
Provably efficient exploration in policy optimization Q Cai, Z Yang, C Jin, Z Wang International Conference on Machine Learning, 1283-1294, 2020	328	2020
Neural policy gradient methods: Global optimality and rates of convergence L Wang, Q Cai, Z Yang, Z Wang International Conference on Learning Representations 2020, 2019	271	2019
Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy B Liu, Q Cai, Z Yang, Z Wang Advances in Neural Information Processing Systems, 10564-10575, 2019	232	2019
Neural temporal-difference learning converges to global optima Q Cai, Z Yang, JD Lee, Z Wang Advances in Neural Information Processing Systems 32, 2019	154*	2019
On the Global Optimality of Model-Agnostic Meta-Learning: Reinforcement Learning and Supervised Learning L Wang, Q Cai, Z Yang, Z Wang International Conference on Machine Learning, 9837-9846, 2020	54	2020
On the global convergence of imitation learning: A case for linear quadratic regulator Q Cai, M Hong, Y Chen, Z Wang arXiv preprint arXiv:1901.03674, 2019	40	2019
Generative adversarial imitation learning with neural network parameterization: Global optimality and convergence rate Y Zhang, Q Cai, Z Yang, Z Wang International conference on machine learning, 11044-11054, 2020	34*	2020
Reinforcement learning from partial observation: Linear function approximation with provable sample efficiency Q Cai, Z Yang, Z Wang International Conference on Machine Learning, 2485-2522, 2022	29*	2022
Represent to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency L Wang, Q Cai, Z Yang, Z Wang The Eleventh International Conference on Learning Representations, 0	28*
Provably efficient offline reinforcement learning for partially observable markov decision processes H Guo, Q Cai, Y Zhang, Z Yang, Z Wang International Conference on Machine Learning, 8016-8038, 2022	17	2022
Can Temporal-Diﬀerence and Q-Learning Learn Representation? A Mean-Field Theory Y Zhang, Q Cai, Z Yang, Y Chen, Z Wang Advances in Neural Information Processing Systems 33, 19680-19692, 2020	15	2020
Neural temporal difference and q learning provably converge to global optima Q Cai, Z Yang, JD Lee, Z Wang Mathematics of Operations Research 49 (1), 619-651, 2024	11	2024
An analysis of attention via the lens of exchangeability and latent variable models Y Zhang, B Liu, Q Cai, L Wang, Z Wang arXiv preprint arXiv:2212.14852, 2022	9	2022
Optimistic Policy Optimization with General Function Approximations Q Cai, Z Yang, C Szepesvari, Z Wang	2	2021
Provably Efficient Reinforcement Learning Q Cai Northwestern University, 2022		2022
BooVI: provably efficient bootstrapped value iteration B Liu, Q Cai, Z Yang, Z Wang Advances in Neural Information Processing Systems 34, 7041-7053, 2021		2021

Sistem tidak dapat melakukan operasi ini. Coba lagi nanti.

Artikel 1–16

Kutipan per tahun

Kutipan duplikat

Kutipan yang digabung

Tambahkan pengarang bersamaPengarang bersama

Ikuti

Dikutip oleh