Follow
Qingpeng Cai
Qingpeng Cai
Kuaishou Technology
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
A deep reinforcement learning framework for rebalancing dockless bike sharing systems
L Pan, Q Cai, Z Fang, P Tang, L Huang
Proceedings of the AAAI conference on artificial intelligence 33 (01), 1393-1400, 2019
1922019
Softmax deep double deterministic policy gradients
L Pan, Q Cai, L Huang
Advances in neural information processing systems 33, 11767-11777, 2020
1112020
Reinforcement Mechanism Design for e-commerce
Q Cai, A Filos-Ratsikas, P Tang, Y Zhang
Proceedings of the 2018 World Wide Web Conference, 1339-1348, 2018
882018
Reinforcement learning with dynamic boltzmann softmax updates
L Pan, Q Cai, Q Meng, W Chen, L Huang, TY Liu
IJCAI-2020, 2019
502019
Two-stage constrained actor-critic for short video recommendation
Q Cai, Z Xue, C Zhang, W Xue, S Liu, R Zhan, X Wang, T Zuo, W Xie, ...
Proceedings of the ACM Web Conference 2023, 865-875, 2023
47*2023
Policy gradients for contextual recommendations
F Pan, Q Cai, P Tang, F Zhuang, Q He
The World Wide Web Conference, 1421-1431, 2019
472019
Facility location with minimax envy
Q Cai, A Filos-Ratsikas, P Tang
IJCAI 2016, 137-143, 2016
462016
Reinforcement mechanism design for fraudulent behaviour in e-commerce
Q Cai, A Filos-Ratsikas, P Tang, Y Zhang
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
432018
Multi-task recommendations with reinforcement learning
Z Liu, J Tian, Q Cai, X Zhao, J Gao, S Liu, D Chen, T He, D Zheng, P Jiang, ...
Proceedings of the ACM Web Conference 2023, 1273-1282, 2023
412023
Reinforcing user retention in a billion scale short video recommender system
Q Cai, S Liu, X Wang, T Zuo, W Xie, B Yang, D Zheng, P Jiang, K Gai
Companion Proceedings of the ACM Web Conference 2023, 421-426, 2023
412023
A large language model enhanced conversational recommender system
Y Feng, S Liu, Z Xue, Q Cai, L Hu, P Jiang, K Gai, F Sun
arXiv preprint arXiv:2308.06212, 2023
362023
Reinforcement Learning Driven Heuristic Optimization
Q Cai, W Hang, A Mirhoseini, G Tucker, J Wang, W Wei
DRL4KDD-2019, 2019
362019
PrefRec: recommender systems with human preferences for reinforcing long-term user engagement
W Xue, Q Cai, Z Xue, S Sun, S Liu, D Zheng, P Jiang, K Gai, B An
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023
29*2023
ResAct: Reinforcing long-term engagement in sequential recommendation with residual actor
W Xue, Q Cai, R Zhan, D Zheng, P Jiang, K Gai, B An
ICLR-2023, 2022
292022
Exploration and regularization of the latent action space in recommendation
S Liu, Q Cai, B Sun, Y Wang, J Jiang, D Zheng, P Jiang, K Gai, X Zhao, ...
Proceedings of the ACM Web Conference 2023, 833-844, 2023
272023
KuaiSim: A comprehensive simulator for recommender systems
K Zhao, S Liu, Q Cai, X Zhao, Z Liu, D Zheng, P Jiang, K Gai
Advances in Neural Information Processing Systems 36, 44880-44897, 2023
232023
Generative flow network for listwise recommendation
S Liu, Q Cai, Z He, B Sun, J McAuley, D Zheng, P Jiang, K Gai
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023
162023
Policy optimization with model-based explorations
F Pan, Q Cai, AX Zeng, CX Pan, Q Da, H He, Q He, P Tang
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 4675-4682, 2019
162019
Mechanism design for personalized recommender systems
Q Cai, A Filos-Ratsikas, C Liu, P Tang
Proceedings of the 10th ACM Conference on Recommender Systems, 159-166, 2016
162016
M3oE: Multi-Domain Multi-Task Mixture-of Experts Recommendation Framework
Z Zhang, S Liu, J Yu, Q Cai, X Zhao, C Zhang, Z Liu, Q Liu, H Zhao, L Hu, ...
Proceedings of the 47th International ACM SIGIR Conference on Research and …, 2024
142024
The system can't perform the operation now. Try again later.
Articles 1–20