A deep reinforcement learning framework for rebalancing dockless bike sharing systems L Pan, Q Cai, Z Fang, P Tang, L Huang Proceedings of the AAAI conference on artificial intelligence 33 (01), 1393-1400, 2019 | 192 | 2019 |
Softmax deep double deterministic policy gradients L Pan, Q Cai, L Huang Advances in neural information processing systems 33, 11767-11777, 2020 | 111 | 2020 |
Reinforcement Mechanism Design for e-commerce Q Cai, A Filos-Ratsikas, P Tang, Y Zhang Proceedings of the 2018 World Wide Web Conference, 1339-1348, 2018 | 88 | 2018 |
Reinforcement learning with dynamic boltzmann softmax updates L Pan, Q Cai, Q Meng, W Chen, L Huang, TY Liu IJCAI-2020, 2019 | 50 | 2019 |
Two-stage constrained actor-critic for short video recommendation Q Cai, Z Xue, C Zhang, W Xue, S Liu, R Zhan, X Wang, T Zuo, W Xie, ... Proceedings of the ACM Web Conference 2023, 865-875, 2023 | 47* | 2023 |
Policy gradients for contextual recommendations F Pan, Q Cai, P Tang, F Zhuang, Q He The World Wide Web Conference, 1421-1431, 2019 | 47 | 2019 |
Facility location with minimax envy Q Cai, A Filos-Ratsikas, P Tang IJCAI 2016, 137-143, 2016 | 46 | 2016 |
Reinforcement mechanism design for fraudulent behaviour in e-commerce Q Cai, A Filos-Ratsikas, P Tang, Y Zhang Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 43 | 2018 |
Multi-task recommendations with reinforcement learning Z Liu, J Tian, Q Cai, X Zhao, J Gao, S Liu, D Chen, T He, D Zheng, P Jiang, ... Proceedings of the ACM Web Conference 2023, 1273-1282, 2023 | 41 | 2023 |
Reinforcing user retention in a billion scale short video recommender system Q Cai, S Liu, X Wang, T Zuo, W Xie, B Yang, D Zheng, P Jiang, K Gai Companion Proceedings of the ACM Web Conference 2023, 421-426, 2023 | 41 | 2023 |
A large language model enhanced conversational recommender system Y Feng, S Liu, Z Xue, Q Cai, L Hu, P Jiang, K Gai, F Sun arXiv preprint arXiv:2308.06212, 2023 | 36 | 2023 |
Reinforcement Learning Driven Heuristic Optimization Q Cai, W Hang, A Mirhoseini, G Tucker, J Wang, W Wei DRL4KDD-2019, 2019 | 36 | 2019 |
PrefRec: recommender systems with human preferences for reinforcing long-term user engagement W Xue, Q Cai, Z Xue, S Sun, S Liu, D Zheng, P Jiang, K Gai, B An Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023 | 29* | 2023 |
ResAct: Reinforcing long-term engagement in sequential recommendation with residual actor W Xue, Q Cai, R Zhan, D Zheng, P Jiang, K Gai, B An ICLR-2023, 2022 | 29 | 2022 |
Exploration and regularization of the latent action space in recommendation S Liu, Q Cai, B Sun, Y Wang, J Jiang, D Zheng, P Jiang, K Gai, X Zhao, ... Proceedings of the ACM Web Conference 2023, 833-844, 2023 | 27 | 2023 |
KuaiSim: A comprehensive simulator for recommender systems K Zhao, S Liu, Q Cai, X Zhao, Z Liu, D Zheng, P Jiang, K Gai Advances in Neural Information Processing Systems 36, 44880-44897, 2023 | 23 | 2023 |
Generative flow network for listwise recommendation S Liu, Q Cai, Z He, B Sun, J McAuley, D Zheng, P Jiang, K Gai Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023 | 16 | 2023 |
Policy optimization with model-based explorations F Pan, Q Cai, AX Zeng, CX Pan, Q Da, H He, Q He, P Tang Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 4675-4682, 2019 | 16 | 2019 |
Mechanism design for personalized recommender systems Q Cai, A Filos-Ratsikas, C Liu, P Tang Proceedings of the 10th ACM Conference on Recommender Systems, 159-166, 2016 | 16 | 2016 |
M3oE: Multi-Domain Multi-Task Mixture-of Experts Recommendation Framework Z Zhang, S Liu, J Yu, Q Cai, X Zhao, C Zhang, Z Liu, Q Liu, H Zhao, L Hu, ... Proceedings of the 47th International ACM SIGIR Conference on Research and …, 2024 | 14 | 2024 |