Yulai Zhao

Hivatkozott rá

	Összes	2020 óta
Hivatkozások	213	213
h-index	7	7
i10-index	7	7

100

2021202220232024202513 21 29 96 54

Nyilvános hozzáférés

Összes megtekintése

3 cikk

0 cikk

elérhető

nem érhető el

Finanszírozási megbízások alapján

Társszerzők

Tommaso BiancalaniGenentechE-mail megerősítve itt: gene.com
Sergey LevineUC Berkeley, Physical IntelligenceE-mail megerősítve itt: eecs.berkeley.edu
Masatoshi UeharaEvolutionaryScaleE-mail megerősítve itt: evolutionaryscale.ai
Gabriele ScaliaGenentechE-mail megerősítve itt: gene.com
Ehsan HajiramezanaliPrincipal Research Scientist, GenentechE-mail megerősítve itt: gene.com
Simon Shaolei DuAssistant Professor, School of Computer Science and Engineering, University of WashingtonE-mail megerősítve itt: cs.washington.edu
Yuandong TianResearch Scientist, Meta AI (FAIR)E-mail megerősítve itt: fb.com
Gokcen EraslanPrincipal Scientist @ GenentechE-mail megerősítve itt: gene.com
Shuiwang Ji, Professor and Truchard...Department of Computer Science & Engineering, Texas A&M UniversityE-mail megerősítve itt: tamu.edu
Sun-Yuan KungProfessor of Electrical Engineering, Princeton UniversityE-mail megerősítve itt: princeton.edu
Jianshu ChenPrincipal Scientist, AmazonE-mail megerősítve itt: ucla.edu
Zhuoran YangYale UniversityE-mail megerősítve itt: yale.edu

Követés

Yulai Zhao

Princeton University

E-mail megerősítve itt: princeton.edu - Kezdőlap

Reinforcement Learning ML for Science


Cím Rendezés hivatkozások szerint Rendezés év szerint Rendezés cím szerint	Hivatkozott rá Hivatkozott rá	Év
Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games Y Zhao, Y Tian, J Lee, S Du International Conference on Artificial Intelligence and Statistics (AISTATS …, 2022	72	2022
Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control M Uehara, Y Zhao, K Black, E Hajiramezanali, G Scalia, NL Diamant, ... arXiv preprint arXiv:2402.15194, 2024	40	2024
Feedback Efficient Online Fine-Tuning of Diffusion Models M Uehara, Y Zhao, K Black, E Hajiramezanali, G Scalia, NL Diamant, ... International Conference on Machine Learning (ICML), 48892-48918, 2024	24	2024
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review M Uehara, Y Zhao, T Biancalani, S Levine arXiv preprint arXiv:2407.13734, 2024	23	2024
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models M Uehara, Y Zhao, E Hajiramezanali, G Scalia, G Eraslan, A Lal, ... Conference on Neural Information Processing Systems (NeurIPS), 2024	13	2024
Derivative-free guidance in continuous and discrete diffusion models with soft value-based decoding X Li, Y Zhao, C Wang, G Scalia, G Eraslan, S Nair, T Biancalani, S Ji, ... arXiv preprint arXiv:2408.08252, 2024	12	2024
Optimizing the Performative Risk under Weak Convexity Assumptions Y Zhao OPT 2022: Optimization for Machine Learning (NeurIPS 2022 Workshop), 2022	10	2022
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning Y Zhao, Z Yang, Z Wang, JD Lee International Conference on Machine Learning (ICML), 42200-42226, 2023	6	2023
Adding Conditional Control to Diffusion Models with Reinforcement Learning Y Zhao, M Uehara, G Scalia, S Kung, T Biancalani, S Levine, ... International Conference on Learning Representations (ICLR), 2024	5	2024
Provably Efficient CVaR RL in Low-rank MDPs Y Zhao, W Zhan, X Hu*, H Leung, F Farnia, W Sun, JD Lee International Conference on Learning Representations (ICLR), 2024	3	2024
Blessing of Class Diversity in Pre-training Y Zhao, J Chen, SS Du International Conference on Artificial Intelligence and Statistics (AISTATS …, 2023	3	2023
Inference-Time Alignment in Diffusion Models with Reward-Guided Generation: Tutorial and Review M Uehara, Y Zhao, C Wang, X Li, A Regev, S Levine, T Biancalani arXiv preprint arXiv:2501.09685, 2025	2*	2025
Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design M Uehara, X Su, Y Zhao, X Li, A Regev, S Ji, S Levine, T Biancalani arXiv preprint arXiv:2502.14944, 2025		2025

A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.

Cikkek 1–13

Hivatkozások évente

Ismétlődő hivatkozások

Összevont hivatkozások

Társszerzők hozzáadásaTársszerzők

Követés

Hivatkozott rá

Társszerzők