Prati
Qinqing Zheng
Qinqing Zheng
Meta
Potvrđena adresa e-pošte na meta.com - Početna stranica
Naslov
Citirano
Citirano
Godina
Online decision transformer
Q Zheng, A Zhang, A Grover
International Conference on Machine Learning 162, 27042--27059, 2022
2602022
A convergent gradient descent algorithm for rank minimization and semidefinite programming from random linear measurements
Q Zheng, J Lafferty
Advances in Neural Information Processing Systems (NeurIPS), 109--117, 2015
2252015
Convergence analysis for rectangular matrix completion using Burer-Monteiro factorization and gradient descent
Q Zheng, J Lafferty
arXiv preprint arXiv:1605.07051, 2016
1902016
Federated f-differential privacy
Q Zheng, S Chen, Q Long, W Su
AISTATS 2021, 2021
752021
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
L Lehnert, S Sukhbaatar, DJ Su, Q Zheng, P Mcvay, M Rabbat, Y Tian
COLM 2024, 2024
372024
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
H Sikchi, Q Zheng, A Zhang, S Niekum
ICLR 2024, 2023
34*2023
Minimax Estimation for Personalized Federated Learning: An Alternative between FedAvg and Local Training?
S Chen, Q Zheng, Q Long, WJ Su
JMLR, 2023
29*2023
Guided flows for generative modeling and decision making
Q Zheng, M Le, N Shaul, Y Lipman, A Grover, RTQ Chen
arXiv preprint arXiv:2311.13443, 2023
232023
Diffusion world model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Z Ding, A Zhang, Y Tian, Q Zheng
arXiv preprint arXiv:2402.03570, 2024
21*2024
Semi-supervised offline reinforcement learning with action-free trajectories
Q Zheng, M Henaff, B Amos, A Grover
ICML 2023, 2023
212023
Sharp Composition Bounds for Gaussian Differential Privacy via Edgeworth Expansion
Q Zheng, J Dong, Q Long, WJ Su
ICML 2020, 2020
212020
Interpolating convex and non-convex tensor decompositions via the subspace norm
Q Zheng, R Tomioka
NeurIPS 2015, 2015
162015
Near-Optimal Confidence Sequences for Bounded Random Variables
AK Kuchibhotla, Q Zheng
ICML 2021, 2021
112021
Latent state marginalization as a low-cost approach for improving exploration
D Zhang, A Courville, Y Bengio, Q Zheng, A Zhang, RTQ Chen
ICLR 2023, 2022
102022
Reliable conditioning of behavioral cloning for offline reinforcement learning
T Nguyen, Q Zheng, A Grover
arXiv preprint arXiv:2210.05158, 2022
9*2022
Dualformer: Controllable fast and slow thinking by learning with randomized reasoning traces
DJ Su, S Sukhbaatar, M Rabbat, Y Tian, Q Zheng
ICLR 2025, 2024
72024
Shadowsync: Performing synchronization in the background for highly scalable distributed training
Q Zheng, BY Su, J Yang, A Azzolini, Q Wu, O Jin, S Karandikar, ...
arXiv preprint arXiv:2003.03477, 2020
72020
Performing Synchronization in the Background for Highly Scalable Distributed Training
Q Zheng, SU Bor-Yiing, J Yang, AG Azzolini, Q Wu, O Jin
US Patent App. 16/989,131, 2022
22022
Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback
Q Zheng, M Henaff, A Zhang, A Grover, B Amos
arXiv preprint arXiv:2410.23022, 2024
12024
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
DJ Su, H Zhu, Y Xu, J Jiao, Y Tian, Q Zheng
arXiv preprint arXiv:2502.03275, 2025
2025
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–20