Segui
Huizhuo Yuan
Huizhuo Yuan
Email verificata su ucla.edu
Titolo
Citata da
Citata da
Anno
Self-play fine-tuning converts weak language models to strong language models
Z Chen, Y Deng, H Yuan, K Ji, Q Gu
arXiv preprint arXiv:2401.01335, 2024
1862024
Self-play preference optimization for language model alignment
Y Wu, Z Sun, H Yuan, K Ji, Y Yang, Q Gu
arXiv preprint arXiv:2405.00675, 2024
652024
SIPID: A deep learning framework for sinogram interpolation and image denoising in low-dose CT reconstruction
H Yuan, J Jia, Z Zhu
2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018 …, 2018
512018
A general framework for sample-efficient function approximation in reinforcement learning
Z Chen, CJ Li, A Yuan, Q Gu, MI Jordan
arXiv preprint arXiv:2209.15634, 2022
382022
Stochastic recursive momentum for policy gradient methods
H Yuan, X Lian, J Liu, Y Zhou
arXiv preprint arXiv:2003.04302, 2020
342020
Efficient smooth non-convex stochastic compositional optimization via stochastic recursive gradient descent
W Hu, CJ Li, X Lian, J Liu, H Yuan
Advances in Neural Information Processing Systems 32, 2019
342019
Stochastic recursive momentum method for non-convex compositional optimization
H Yuan, W Hu
arXiv preprint arXiv:2006.01688, 2020
142020
Protein conformation generation via force-guided se (3) diffusion models
Y Wang, L Wang, Y Shen, Y Wang, H Yuan, Y Wu, Q Gu
arXiv preprint arXiv:2403.14088, 2024
122024
Stochastic recursive variance reduction for efficient smooth non-convex compositional optimization
H Yuan, X Lian, J Liu
arXiv preprint arXiv:1912.13515, 2019
122019
Differential inclusions for modeling nonsmooth ADMM variants: A continuous limit theory
H Yuan, Y Zhou, CJ Li, Q Sun
International Conference on Machine Learning, 7232-7241, 2019
102019
Self-play fine-tuning of diffusion models for text-to-image generation
H Yuan, Z Chen, K Ji, Q Gu
arXiv preprint arXiv:2402.10210, 2024
92024
Stochastic modified equations for continuous limit of stochastic ADMM
X Zhou, H Yuan, CJ Li, Q Sun
arXiv preprint arXiv:2003.03532, 2020
92020
Nesterov meets optimism: rate-optimal separable minimax optimization
CJ Li, H Yuan, G Gidel, Q Gu, M Jordan
International Conference on Machine Learning, 20351-20383, 2023
72023
Self-play fine-tuning converts weak language models to strong language models, 2024
Z Chen, Y Deng, H Yuan, K Ji, Q Gu
URL https://arxiv. org/abs/2401.01335, 0
7
Object-oriented state abstraction in reinforcement learning for video games
Y Chen, H Yuan, Y Li
2019 IEEE Conference on Games (CoG), 1-4, 2019
62019
Policy optimization via stochastic recursive gradient algorithm
H Yuan, CJ Li, Y Tang, Y Zhou
62019
Fast Sampling via De-randomization for Discrete Diffusion Models
Z Chen, H Yuan, Y Li, Y Kou, J Zhang, Q Gu
arXiv preprint arXiv:2312.09193, 2023
52023
Tensor Product Attention Is All You Need
Y Zhang, Y Liu, H Yuan, Z Qin, Y Yuan, Q Gu, ACC Yao
arXiv preprint arXiv:2501.06425, 2025
12025
Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time
Z Chen, H Yuan, Y Li, Y Kou, J Zhang, Q Gu
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
12024
Mars: Unleashing the power of variance reduction for training large models
H Yuan, Y Liu, S Wu, X Zhou, Q Gu
arXiv preprint arXiv:2411.10438, 2024
12024
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20