Követés
Yulai Zhao
Yulai Zhao
E-mail megerősítve itt: princeton.edu - Kezdőlap
Cím
Hivatkozott rá
Hivatkozott rá
Év
Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games
Y Zhao, Y Tian, J Lee, S Du
International Conference on Artificial Intelligence and Statistics (AISTATS …, 2022
722022
Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
M Uehara*, Y Zhao*, K Black, E Hajiramezanali, G Scalia, NL Diamant, ...
arXiv preprint arXiv:2402.15194, 2024
402024
Feedback Efficient Online Fine-Tuning of Diffusion Models
M Uehara*, Y Zhao*, K Black, E Hajiramezanali, G Scalia, NL Diamant, ...
International Conference on Machine Learning (ICML), 48892-48918, 2024
242024
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review
M Uehara*, Y Zhao*, T Biancalani, S Levine
arXiv preprint arXiv:2407.13734, 2024
232024
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models
M Uehara*, Y Zhao*, E Hajiramezanali, G Scalia, G Eraslan, A Lal, ...
Conference on Neural Information Processing Systems (NeurIPS), 2024
132024
Derivative-free guidance in continuous and discrete diffusion models with soft value-based decoding
X Li, Y Zhao, C Wang, G Scalia, G Eraslan, S Nair, T Biancalani, S Ji, ...
arXiv preprint arXiv:2408.08252, 2024
122024
Optimizing the Performative Risk under Weak Convexity Assumptions
Y Zhao
OPT 2022: Optimization for Machine Learning (NeurIPS 2022 Workshop), 2022
102022
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
Y Zhao, Z Yang, Z Wang, JD Lee
International Conference on Machine Learning (ICML), 42200-42226, 2023
62023
Adding Conditional Control to Diffusion Models with Reinforcement Learning
Y Zhao*, M Uehara*, G Scalia, S Kung, T Biancalani, S Levine, ...
International Conference on Learning Representations (ICLR), 2024
52024
Provably Efficient CVaR RL in Low-rank MDPs
Y Zhao*, W Zhan*, X Hu*, H Leung, F Farnia, W Sun, JD Lee
International Conference on Learning Representations (ICLR), 2024
32024
Blessing of Class Diversity in Pre-training
Y Zhao, J Chen, SS Du
International Conference on Artificial Intelligence and Statistics (AISTATS …, 2023
32023
Inference-Time Alignment in Diffusion Models with Reward-Guided Generation: Tutorial and Review
M Uehara, Y Zhao, C Wang, X Li, A Regev, S Levine, T Biancalani
arXiv preprint arXiv:2501.09685, 2025
2*2025
Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design
M Uehara, X Su, Y Zhao, X Li, A Regev, S Ji, S Levine, T Biancalani
arXiv preprint arXiv:2502.14944, 2025
2025
A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.
Cikkek 1–13