Self-play fine-tuning converts weak language models to strong language models Z Chen, Y Deng, H Yuan, K Ji, Q Gu arXiv preprint arXiv:2401.01335, 2024 | 186 | 2024 |
Self-play preference optimization for language model alignment Y Wu, Z Sun, H Yuan, K Ji, Y Yang, Q Gu arXiv preprint arXiv:2405.00675, 2024 | 65 | 2024 |
SIPID: A deep learning framework for sinogram interpolation and image denoising in low-dose CT reconstruction H Yuan, J Jia, Z Zhu 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018 …, 2018 | 51 | 2018 |
A general framework for sample-efficient function approximation in reinforcement learning Z Chen, CJ Li, A Yuan, Q Gu, MI Jordan arXiv preprint arXiv:2209.15634, 2022 | 38 | 2022 |
Stochastic recursive momentum for policy gradient methods H Yuan, X Lian, J Liu, Y Zhou arXiv preprint arXiv:2003.04302, 2020 | 34 | 2020 |
Efficient smooth non-convex stochastic compositional optimization via stochastic recursive gradient descent W Hu, CJ Li, X Lian, J Liu, H Yuan Advances in Neural Information Processing Systems 32, 2019 | 34 | 2019 |
Stochastic recursive momentum method for non-convex compositional optimization H Yuan, W Hu arXiv preprint arXiv:2006.01688, 2020 | 14 | 2020 |
Protein conformation generation via force-guided se (3) diffusion models Y Wang, L Wang, Y Shen, Y Wang, H Yuan, Y Wu, Q Gu arXiv preprint arXiv:2403.14088, 2024 | 12 | 2024 |
Stochastic recursive variance reduction for efficient smooth non-convex compositional optimization H Yuan, X Lian, J Liu arXiv preprint arXiv:1912.13515, 2019 | 12 | 2019 |
Differential inclusions for modeling nonsmooth ADMM variants: A continuous limit theory H Yuan, Y Zhou, CJ Li, Q Sun International Conference on Machine Learning, 7232-7241, 2019 | 10 | 2019 |
Self-play fine-tuning of diffusion models for text-to-image generation H Yuan, Z Chen, K Ji, Q Gu arXiv preprint arXiv:2402.10210, 2024 | 9 | 2024 |
Stochastic modified equations for continuous limit of stochastic ADMM X Zhou, H Yuan, CJ Li, Q Sun arXiv preprint arXiv:2003.03532, 2020 | 9 | 2020 |
Nesterov meets optimism: rate-optimal separable minimax optimization CJ Li, H Yuan, G Gidel, Q Gu, M Jordan International Conference on Machine Learning, 20351-20383, 2023 | 7 | 2023 |
Self-play fine-tuning converts weak language models to strong language models, 2024 Z Chen, Y Deng, H Yuan, K Ji, Q Gu URL https://arxiv. org/abs/2401.01335, 0 | 7 | |
Object-oriented state abstraction in reinforcement learning for video games Y Chen, H Yuan, Y Li 2019 IEEE Conference on Games (CoG), 1-4, 2019 | 6 | 2019 |
Policy optimization via stochastic recursive gradient algorithm H Yuan, CJ Li, Y Tang, Y Zhou | 6 | 2019 |
Fast Sampling via De-randomization for Discrete Diffusion Models Z Chen, H Yuan, Y Li, Y Kou, J Zhang, Q Gu arXiv preprint arXiv:2312.09193, 2023 | 5 | 2023 |
Tensor Product Attention Is All You Need Y Zhang, Y Liu, H Yuan, Z Qin, Y Yuan, Q Gu, ACC Yao arXiv preprint arXiv:2501.06425, 2025 | 1 | 2025 |
Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time Z Chen, H Yuan, Y Li, Y Kou, J Zhang, Q Gu The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024 | 1 | 2024 |
Mars: Unleashing the power of variance reduction for training large models H Yuan, Y Liu, S Wu, X Zhou, Q Gu arXiv preprint arXiv:2411.10438, 2024 | 1 | 2024 |