Folgen
Yue Wu
Yue Wu
Postdoctoral Research Fellow, Princeton University
Bestätigte E-Mail-Adresse bei ucla.edu - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Towards understanding the spectral bias of deep learning
Y Cao, Z Fang, Y Wu, DX Zhou, Q Gu
arXiv preprint arXiv:1912.01198, 2019
2462019
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Y Wu, W Zhang, P Xu, Q Gu
NeurIPS 2020, 2020
1652020
Towards understanding learning representations: To what extent do different neural networks learn the same representation
L Wang, L Hu, J Gu, Z Hu, Y Wu, K He, J Hopcroft
Advances in neural information processing systems 31, 2018
1272018
DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text
X Yang, W Cheng, Y Wu, L Petzold, WY Wang, H Chen
arXiv preprint arXiv:2305.17359, 2023
842023
Towards understanding the mixture-of-experts layer in deep learning
Z Chen, Y Deng, Y Wu, Q Gu, Y Li
Advances in neural information processing systems 35, 23049-23062, 2022
702022
Self-play preference optimization for language model alignment
Y Wu, Z Sun, H Yuan, K Ji, Y Yang, Q Gu
arXiv preprint arXiv:2405.00675, 2024
662024
Personalized Federated Learning under Mixture of Distributions
Y Wu, S Zhang, W Yu, Y Liu, Q Gu, D Zhou, H Chen, W Cheng
Fortieth International Conference on Machine Learning (ICML 2023), 2023
442023
Nearly minimax optimal regret for learning infinite-horizon average-reward mdps with linear function approximation
Y Wu, D Zhou, Q Gu
International Conference on Artificial Intelligence and Statistics, 3883-3913, 2022
242022
Protein conformation generation via force-guided se (3) diffusion models
Y Wang, L Wang, Y Shen, Y Wang, H Yuan, Y Wu, Q Gu
arXiv preprint arXiv:2403.14088, 2024
122024
Variance-aware regret bounds for stochastic contextual dueling bandits
Q Di, T Jin, Y Wu, H Zhao, F Farnoud, Q Gu
arXiv preprint arXiv:2310.00968, 2023
112023
Borda regret minimization for generalized linear dueling bandits
Y Wu, T Jin, H Lou, F Farnoud, Q Gu
arXiv preprint arXiv:2303.08816, 2023
92023
Active ranking without strong stochastic transitivity
H Lou, T Jin, Y Wu, P Xu, Q Gu, F Farnoud
Advances in neural information processing systems 35, 297-309, 2022
82022
Adaptive sampling for heterogeneous rank aggregation from noisy pairwise comparisons
Y Wu, T Jin, H Lou, P Xu, F Farnoud, Q Gu
International Conference on Artificial Intelligence and Statistics, 11014-11036, 2022
62022
Treebon: Enhancing inference-time alignment with speculative tree-search and best-of-n sampling
J Qiu, Y Lu, Y Zeng, J Guo, J Geng, H Wang, K Huang, Y Wu, M Wang
arXiv preprint arXiv:2410.16033, 2024
52024
Uniform-PAC guarantees for model-based RL with bounded eluder dimension
Y Wu, J He, Q Gu
Uncertainty in Artificial Intelligence, 2304-2313, 2023
42023
A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement
H Yuan, Y Zeng, Y Wu, H Wang, M Wang, L Leqi
arXiv preprint arXiv:2410.13828, 2024
2024
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–16