KuaiSim: A comprehensive simulator for recommender systems

K Zhao, S Liu, Q Cai, X Zhao, Z Liu… - Advances in …, 2023 - proceedings.neurips.cc
Reinforcement Learning (RL)-based recommender systems (RSs) have garnered
considerable attention due to their ability to learn optimal recommendation policies and …

Reinforcing user retention in a billion scale short video recommender system

Q Cai, S Liu, X Wang, T Zuo, W **e, B Yang… - … Proceedings of the …, 2023 - dl.acm.org
Recently, short video platforms have achieved rapid user growth by recommending
interesting content to users. The objective of the recommendation is to optimize user …

PrefRec: recommender systems with human preferences for reinforcing long-term user engagement

W Xue, Q Cai, Z Xue, S Sun, S Liu, D Zheng… - Proceedings of the 29th …, 2023 - dl.acm.org
Current advances in recommender systems have been remarkably successful in optimizing
immediate engagement. However, long-term user engagement, a more desirable …

Meta clustering of neural bandits

Y Ban, Y Qi, T Wei, L Liu, J He - Proceedings of the 30th ACM SIGKDD …, 2024 - dl.acm.org
The contextual bandit has been identified as a powerful framework to formulate the
recommendation process as a sequential decision-making process, where each item is …

Future Impact Decomposition in Request-level Recommendations

X Wang, S Liu, X Wang, Q Cai, L Hu, H Li… - Proceedings of the 30th …, 2024 - dl.acm.org
In recommender systems, reinforcement learning solutions have shown promising results in
optimizing the interaction sequence between users and the system over the long-term …

Interpretable User Retention Modeling in Recommendation

R Ding, R **e, X Hao, X Yang, K Ge, X Zhang… - Proceedings of the 17th …, 2023 - dl.acm.org
Recommendation usually focuses on immediate accuracy metrics like CTR as training
objectives. User retention rate, which reflects the percentage of today's users that will return …

UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution

G Zhang, Y Wang, X Chen, H Qian, K Zhan… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
In recent years, there has been a growing interest in utilizing reinforcement learning (RL) to
optimize long-term rewards in recommender systems. Since industrial recommender …

AdaRec: Adaptive sequential recommendation for reinforcing long-term user engagement

Z Xue, Q Cai, T Zuo, B Yang, L Hu, P Jiang… - arxiv preprint arxiv …, 2023 - arxiv.org
Growing attention has been paid to Reinforcement Learning (RL) algorithms when
optimizing long-term user engagement in sequential recommendation tasks. One challenge …

[PDF][PDF] PrefRec: Preference-based recommender systems for reinforcing long-term user engagement

W Xue, Q Cai, Z Xue, S Sun, S Liu… - arxiv preprint arxiv …, 2022 - researchgate.net
Current advances in recommender systems have been remarkably successful in optimizing
immediate engagement. However, longterm user engagement, a more desirable …

LabelCraft: Empowering Short Video Recommendations with Automated Label Crafting

Y Bai, Y Zhang, J Lu, J Chang, X Zang, Y Niu… - Proceedings of the 17th …, 2024 - dl.acm.org
Short video recommendations often face limitations due to the quality of user feedback,
which may not accurately depict user interests. To tackle this challenge, a new task has …