Hidden community detection in social networks K He, Y Li, S Soundarajan, JE Hopcroft Information Sciences 425, 92-106, 2018 | 118 | 2018 |
Hyperdqn: A randomized exploration method for deep reinforcement learning Z Li, Y Li, Y Zhang, T Zhang, ZQ Luo International Conference on Learning Representations, 2021 | 20 | 2021 |
Divergence-augmented policy optimization Q Wang, Y Li, J Xiong, T Zhang Advances in Neural Information Processing Systems 32, 2019 | 15 | 2019 |
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent Y Li, J Xu, L Han, ZQ Luo Forty-first International Conference on Machine Learning, 2024 | 12* | 2024 |
Scaling Flaws of Verifier-Guided Search in Mathematical Reasoning F Yu, Y Li, B Wang arXiv preprint arXiv:2502.00271, 2025 | 1 | 2025 |
Simple, unified analysis of Johnson-Lindenstrauss with applications Y Li arXiv preprint arXiv:2402.10232, 2024 | 1 | 2024 |
Optimistic Thompson Sampling for No-Regret Learning in Unknown Games Y Li, L Liu, W Pu, H Liang, ZQ Luo arXiv preprint arXiv:2402.09456, 2024 | 1 | 2024 |
Scalable Exploration via Ensemble++ Y Li, J Xu, B Wang, ZQ Luo | 1 | 2024 |
Uncertainty-Aware Search and Value Models: Mitigating Search Scaling Flaws in LLMs F Yu, Y Li, B Wang arXiv preprint arXiv:2502.11155, 2025 | | 2025 |
Radar Anti-Jamming Strategy Learning via Domain-Knowledge Enhanced Online Convex Optimization L Liu, W Pu, Y Li, B Jiu, ZQ Luo 2024 IEEE 13rd Sensor Array and Multichannel Signal Processing Workshop (SAM …, 2024 | | 2024 |
Prior-dependent analysis of posterior sampling reinforcement learning with function approximation Y Li, Z Luo International Conference on Artificial Intelligence and Statistics, 559-567, 2024 | | 2024 |
Probability Tools for Sequential Random Projection Y Li arXiv preprint arXiv:2402.14026, 2024 | | 2024 |
GPT-HyperAgent: Scalable Uncertainty Estimation and Exploration for Foundation Model Decisions Y Li, J Xu, ZQ Luo Automated Reinforcement Learning: Exploring Meta-Learning, AutoML, and LLMs, 0 | | |