Federated linear contextual bandits R Huang, W Wu, J Yang, C Shen Advances in Neural Information Processing Systems (NeurIPS) 34, 27057-27068, 2021 | 85 | 2021 |
Federated Linear Contextual Bandits with User-level Differential Privacy R Huang, H Zhang, L Melis, M Shen, M Hejazinia, J Yang International Conference on Machine Learning (ICML), 2023 | 16 | 2023 |
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs Y Cheng, R Huang, Y Liang, J Yang International Conference on Learning Representations (ICLR), 2023 | 11 | 2023 |
Temporal-distributed backdoor attack against video based action recognition X Li, S Wang, R Huang, M Gowda, G Kesidis Proceedings of the AAAI Conference on Artificial Intelligence 38 (4), 3199-3207, 2024 | 6 | 2024 |
Safe Exploration Incurs Nearly No Additional Sample Complexity for Reward-free RL R Huang, J Yang, Y Liang International Conference on Learning Representations (ICLR), 2022 | 6 | 2022 |
Provably efficient ucb-type algorithms for learning predictive state representations R Huang, Y Liang, J Yang arXiv preprint arXiv:2307.00405, 2023 | 5 | 2023 |
Non-stationary Reinforcement Learning under General Function Approximation S Feng, M Yin, R Huang, YX Wang, J Yang, Y Liang International Conference on Machine Learning (ICML), 2023 | 5 | 2023 |
Differentially Private Wireless Federated Learning Using Orthogonal Sequences X Wei, T Wang, R Huang, C Shen, J Yang, HV Poor arXiv preprint arXiv:2306.08280, 2023 | 4 | 2023 |
Non-asymptotic convergence of training transformers for next-token prediction R Huang, Y Liang, J Yang arXiv preprint arXiv:2409.17335, 2024 | 3 | 2024 |
FLORAS: Differentially private wireless federated learning using orthogonal sequences X Wei, T Wang, R Huang, C Shen, J Yang, HV Poor ICC 2023-IEEE International Conference on Communications, 3121-3126, 2023 | 3 | 2023 |
Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints D Li, R Huang, C Shen, J Yang International Conference on Machine Learning (ICML), 2023 | 3 | 2023 |
Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes R Huang, Y Cheng, J Yang, V Tan, Y Liang arXiv preprint arXiv:2310.13550, 2023 | 1 | 2023 |
Cascading Bandits with Two-Level Feedback D Cheng, R Huang, C Shen, J Yang 2022 IEEE International Symposium on Information Theory (ISIT), 2022 | 1 | 2022 |
Robust Offline Reinforcement Learning for Non-Markovian Decision Processes R Huang, Y Liang, J Yang arXiv preprint arXiv:2411.07514, 2024 | | 2024 |
Federated Online Prediction from Experts with Differential Privacy: Separations and Regret Speed-ups F Gao, R Huang, J Yang arXiv preprint arXiv:2409.19092, 2024 | | 2024 |
Towards General Function Approximation in Nonstationary Reinforcement Learning S Feng, M Yin, R Huang, YX Wang, J Yang, Y Liang IEEE Journal on Selected Areas in Information Theory, 2024 | | 2024 |