Obserwuj
Hui Yuan
Hui Yuan
Zweryfikowany adres z princeton.edu - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
MaxMin-RLHF: Alignment with diverse human preferences
S Chakraborty, J Qiu, H Yuan, A Koppel, F Huang, D Manocha, AS Bedi, ...
arXiv preprint arXiv:2402.08925, 2024
682024
Reward-directed conditional diffusion: Provable distribution estimation and reward improvement
H Yuan, K Huang, C Ni, M Chen, M Wang
Advances in Neural Information Processing Systems 36, 60599-60635, 2023
352023
Gradient guidance for diffusion models: An optimization perspective
Y Guo, H Yuan, Y Yang, M Chen, M Wang
Advances in Neural Information Processing Systems 37, 90736-90770, 2025
172025
Diffusion model for data-driven black-box optimization
Z Li, H Yuan, K Huang, C Ni, Y Ye, M Chen, M Wang
arXiv preprint arXiv:2403.13219, 2024
102024
Learning entangled single-sample distributions via iterative trimming
H Yuan, Y Liang
International Conference on Artificial Intelligence and Statistics, 2666-2676, 2020
102020
Neural network is heterogeneous: Phase matters more
Y Nie, H Yuan
arXiv preprint arXiv:2111.02014, 2021
72021
Learning entangled single-sample Gaussians in the subset-of-signals model
Y Liang, H Yuan
Conference on Learning Theory, 2712-2737, 2020
72020
Bandit theory and thompson sampling-guided directed evolution for sequence optimization
H Yuan, C Ni, H Wang, X Zhang, L Cong, C Szepesvári, M Wang
Advances in Neural Information Processing Systems 35, 38291-38304, 2022
62022
Unified off-policy learning to rank: a reinforcement learning perspective
Z Zhang, Y Su, H Yuan, Y Wu, R Balasubramanian, Q Wu, H Wang, ...
Advances in Neural Information Processing Systems 36, 19887-19907, 2023
52023
Adversarial attacks on online learning to rank with stochastic click models
Z Wang, R Balasubramanian, H Yuan, C Song, M Wang, H Wang
arXiv preprint arXiv:2305.19218, 2023
22023
Uniform joint screening for ultra-high dimensional graphical models
Z Zheng, H Shi, Y Li, H Yuan
Journal of Multivariate Analysis 179, 104645, 2020
22020
Conversational Dueling Bandits in Generalized Linear Models
S Yang, H Yuan, X Zhang, M Wang, H Zhang, H Wang
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and …, 2024
12024
Tree search-based evolutionary bandits for protein sequence optimization
J Qiu, H Yuan, J Zhang, W Chen, H Wang, M Wang
Proceedings of the AAAI Conference on Artificial Intelligence 38 (13), 14686 …, 2024
12024
Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models
Y Guo, Y Yang, H Yuan, M Wang
arXiv preprint arXiv:2502.11420, 2025
2025
A First-order Generative Bilevel Optimization Framework for Diffusion Models
Q Xiao, H Yuan, AFM Saif, G Liu, R Kompella, M Wang, T Chen
arXiv preprint arXiv:2502.08808, 2025
2025
A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement
H Yuan, Y Zeng, Y Wu, H Wang, M Wang, L Leqi
arXiv preprint arXiv:2410.13828, 2024
2024
Common Pitfalls of Margin-based Preference Optimization in Language Model Alignment
H Yuan, Y Zeng, Y Wu, H Wang, M Wang, L Leqi
The Thirteenth International Conference on Learning Representations, 0
GuideCO: Training Objective-Guided Diffusion Solver with Imperfect Data for Combinatorial Optimization
H Yuan, Z Hua, Z Li, W Cong, Y Xie, R Jin, S Yang, B Long, M Wang
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–18