Підписатись
Yingru Li
Yingru Li
Інші іменаYing-Ru Li, Y. Li
The Chinese University of Hong Kong, Shenzhen, China
Підтверджена електронна адреса в link.cuhk.edu.cn - Домашня сторінка
Назва
Посилання
Посилання
Рік
Hidden community detection in social networks
K He, Y Li, S Soundarajan, JE Hopcroft
Information Sciences 425, 92-106, 2018
1182018
Hyperdqn: A randomized exploration method for deep reinforcement learning
Z Li, Y Li, Y Zhang, T Zhang, ZQ Luo
International Conference on Learning Representations, 2021
202021
Divergence-augmented policy optimization
Q Wang, Y Li, J Xiong, T Zhang
Advances in Neural Information Processing Systems 32, 2019
152019
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Y Li, J Xu, L Han, ZQ Luo
Forty-first International Conference on Machine Learning, 2024
12*2024
Scaling Flaws of Verifier-Guided Search in Mathematical Reasoning
F Yu, Y Li, B Wang
arXiv preprint arXiv:2502.00271, 2025
12025
Simple, unified analysis of Johnson-Lindenstrauss with applications
Y Li
arXiv preprint arXiv:2402.10232, 2024
12024
Optimistic Thompson Sampling for No-Regret Learning in Unknown Games
Y Li, L Liu, W Pu, H Liang, ZQ Luo
arXiv preprint arXiv:2402.09456, 2024
12024
Scalable Exploration via Ensemble++
Y Li, J Xu, B Wang, ZQ Luo
12024
Uncertainty-Aware Search and Value Models: Mitigating Search Scaling Flaws in LLMs
F Yu, Y Li, B Wang
arXiv preprint arXiv:2502.11155, 2025
2025
Radar Anti-Jamming Strategy Learning via Domain-Knowledge Enhanced Online Convex Optimization
L Liu, W Pu, Y Li, B Jiu, ZQ Luo
2024 IEEE 13rd Sensor Array and Multichannel Signal Processing Workshop (SAM …, 2024
2024
Prior-dependent analysis of posterior sampling reinforcement learning with function approximation
Y Li, Z Luo
International Conference on Artificial Intelligence and Statistics, 559-567, 2024
2024
Probability Tools for Sequential Random Projection
Y Li
arXiv preprint arXiv:2402.14026, 2024
2024
GPT-HyperAgent: Scalable Uncertainty Estimation and Exploration for Foundation Model Decisions
Y Li, J Xu, ZQ Luo
Automated Reinforcement Learning: Exploring Meta-Learning, AutoML, and LLMs, 0
У даний момент система не може виконати операцію. Спробуйте пізніше.
Статті 1–13