KuaiSim: A comprehensive simulator for recommender systems
Reinforcement Learning (RL)-based recommender systems (RSs) have garnered
considerable attention due to their ability to learn optimal recommendation policies and …
considerable attention due to their ability to learn optimal recommendation policies and …
Reinforcing user retention in a billion scale short video recommender system
Recently, short video platforms have achieved rapid user growth by recommending
interesting content to users. The objective of the recommendation is to optimize user …
interesting content to users. The objective of the recommendation is to optimize user …
PrefRec: recommender systems with human preferences for reinforcing long-term user engagement
Current advances in recommender systems have been remarkably successful in optimizing
immediate engagement. However, long-term user engagement, a more desirable …
immediate engagement. However, long-term user engagement, a more desirable …
Meta clustering of neural bandits
The contextual bandit has been identified as a powerful framework to formulate the
recommendation process as a sequential decision-making process, where each item is …
recommendation process as a sequential decision-making process, where each item is …
Future Impact Decomposition in Request-level Recommendations
In recommender systems, reinforcement learning solutions have shown promising results in
optimizing the interaction sequence between users and the system over the long-term …
optimizing the interaction sequence between users and the system over the long-term …
Interpretable User Retention Modeling in Recommendation
Recommendation usually focuses on immediate accuracy metrics like CTR as training
objectives. User retention rate, which reflects the percentage of today's users that will return …
objectives. User retention rate, which reflects the percentage of today's users that will return …
UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution
In recent years, there has been a growing interest in utilizing reinforcement learning (RL) to
optimize long-term rewards in recommender systems. Since industrial recommender …
optimize long-term rewards in recommender systems. Since industrial recommender …
AdaRec: Adaptive sequential recommendation for reinforcing long-term user engagement
Growing attention has been paid to Reinforcement Learning (RL) algorithms when
optimizing long-term user engagement in sequential recommendation tasks. One challenge …
optimizing long-term user engagement in sequential recommendation tasks. One challenge …
[PDF][PDF] PrefRec: Preference-based recommender systems for reinforcing long-term user engagement
Current advances in recommender systems have been remarkably successful in optimizing
immediate engagement. However, longterm user engagement, a more desirable …
immediate engagement. However, longterm user engagement, a more desirable …
LabelCraft: Empowering Short Video Recommendations with Automated Label Crafting
Short video recommendations often face limitations due to the quality of user feedback,
which may not accurately depict user interests. To tackle this challenge, a new task has …
which may not accurately depict user interests. To tackle this challenge, a new task has …