Qingpeng Cai

Cited by

	All	Since 2020
Citations	1003	899
h-index	16	16
i10-index	21	20

320

160

240

2017201820192020202120222023202420256 29 67 84 106 116 216 319 58

Public access

View all

13 articles

1 article

available

not available

Based on funding mandates

Co-authors

Peng JiangKuaishou TechnologyVerified email at kuaishou.com
Kun GaiSenior Director & Researcher, Alibaba GroupVerified email at taobao.com
Ling PanAssistant Professor, Hong Kong University of Science and TechnologyVerified email at ust.hk
Longbo HuangProfessor, IIIS, Tsinghua University, ACM Distinguished ScientistVerified email at tsinghua.edu.cn
Bo AnNanyang Technological UniversityVerified email at ntu.edu.sg
Azalia MirhoseiniAssistant Professor of Computer Science, Stanford - Google DeepMindVerified email at stanford.edu
George TuckerGoogle DeepMindVerified email at google.com
Xiangyu ZhaoAssistant Professor, City University of Hong KongVerified email at cityu.edu.hk
Julian McAuleyProfessor, UC San DiegoVerified email at eng.ucsd.edu

Qingpeng Cai

Kuaishou Technology

Verified email at mails.tsinghua.edu.cn - Homepage

Reinforcement Learning Recommender System


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A deep reinforcement learning framework for rebalancing dockless bike sharing systems L Pan, Q Cai, Z Fang, P Tang, L Huang Proceedings of the AAAI conference on artificial intelligence 33 (01), 1393-1400, 2019	192	2019
Softmax deep double deterministic policy gradients L Pan, Q Cai, L Huang Advances in neural information processing systems 33, 11767-11777, 2020	111	2020
Reinforcement Mechanism Design for e-commerce Q Cai, A Filos-Ratsikas, P Tang, Y Zhang Proceedings of the 2018 World Wide Web Conference, 1339-1348, 2018	88	2018
Reinforcement learning with dynamic boltzmann softmax updates L Pan, Q Cai, Q Meng, W Chen, L Huang, TY Liu IJCAI-2020, 2019	50	2019
Two-stage constrained actor-critic for short video recommendation Q Cai, Z Xue, C Zhang, W Xue, S Liu, R Zhan, X Wang, T Zuo, W Xie, ... Proceedings of the ACM Web Conference 2023, 865-875, 2023	47*	2023
Policy gradients for contextual recommendations F Pan, Q Cai, P Tang, F Zhuang, Q He The World Wide Web Conference, 1421-1431, 2019	47	2019
Facility location with minimax envy Q Cai, A Filos-Ratsikas, P Tang IJCAI 2016, 137-143, 2016	46	2016
Reinforcement mechanism design for fraudulent behaviour in e-commerce Q Cai, A Filos-Ratsikas, P Tang, Y Zhang Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	43	2018
Multi-task recommendations with reinforcement learning Z Liu, J Tian, Q Cai, X Zhao, J Gao, S Liu, D Chen, T He, D Zheng, P Jiang, ... Proceedings of the ACM Web Conference 2023, 1273-1282, 2023	41	2023
Reinforcing user retention in a billion scale short video recommender system Q Cai, S Liu, X Wang, T Zuo, W Xie, B Yang, D Zheng, P Jiang, K Gai Companion Proceedings of the ACM Web Conference 2023, 421-426, 2023	41	2023
A large language model enhanced conversational recommender system Y Feng, S Liu, Z Xue, Q Cai, L Hu, P Jiang, K Gai, F Sun arXiv preprint arXiv:2308.06212, 2023	36	2023
Reinforcement Learning Driven Heuristic Optimization Q Cai, W Hang, A Mirhoseini, G Tucker, J Wang, W Wei DRL4KDD-2019, 2019	36	2019
PrefRec: recommender systems with human preferences for reinforcing long-term user engagement W Xue, Q Cai, Z Xue, S Sun, S Liu, D Zheng, P Jiang, K Gai, B An Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023	29*	2023
ResAct: Reinforcing long-term engagement in sequential recommendation with residual actor W Xue, Q Cai, R Zhan, D Zheng, P Jiang, K Gai, B An ICLR-2023, 2022	29	2022
Exploration and regularization of the latent action space in recommendation S Liu, Q Cai, B Sun, Y Wang, J Jiang, D Zheng, P Jiang, K Gai, X Zhao, ... Proceedings of the ACM Web Conference 2023, 833-844, 2023	27	2023
KuaiSim: A comprehensive simulator for recommender systems K Zhao, S Liu, Q Cai, X Zhao, Z Liu, D Zheng, P Jiang, K Gai Advances in Neural Information Processing Systems 36, 44880-44897, 2023	23	2023
Generative flow network for listwise recommendation S Liu, Q Cai, Z He, B Sun, J McAuley, D Zheng, P Jiang, K Gai Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023	16	2023
Policy optimization with model-based explorations F Pan, Q Cai, AX Zeng, CX Pan, Q Da, H He, Q He, P Tang Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 4675-4682, 2019	16	2019
Mechanism design for personalized recommender systems Q Cai, A Filos-Ratsikas, C Liu, P Tang Proceedings of the 10th ACM Conference on Recommender Systems, 159-166, 2016	16	2016
M³oE: Multi-Domain Multi-Task Mixture-of Experts Recommendation Framework Z Zhang, S Liu, J Yu, Q Cai, X Zhao, C Zhang, Z Liu, Q Liu, H Zhao, L Hu, ... Proceedings of the 47th International ACM SIGIR Conference on Research and …, 2024	14	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors