[HTML][HTML] Deep reinforcement learning in recommender systems: A survey and new perspectives
In light of the emergence of deep reinforcement learning (DRL) in recommender systems
research and several fruitful results in recent years, this survey aims to provide a timely and …
research and several fruitful results in recent years, this survey aims to provide a timely and …
Alleviating matthew effect of offline reinforcement learning in interactive recommendation
Offline reinforcement learning (RL), a technology that offline learns a policy from logged data
without the need to interact with online environments, has become a favorable choice in …
without the need to interact with online environments, has become a favorable choice in …
Large language models are learnable planners for long-term recommendation
Planning for both immediate and long-term benefits becomes increasingly important in
recommendation. Existing methods apply Reinforcement Learning (RL) to learn planning …
recommendation. Existing methods apply Reinforcement Learning (RL) to learn planning …
[PDF][PDF] A comprehensive survey of small language models in the era of large language models: Techniques, enhancements, applications, collaboration with llms, and …
Large language models (LLM) have demonstrated emergent abilities in text generation,
question answering, and reasoning, facilitating various tasks and domains. Despite their …
question answering, and reasoning, facilitating various tasks and domains. Despite their …
Two-stage constrained actor-critic for short video recommendation
The wide popularity of short videos on social media poses new opportunities and
challenges to optimize recommender systems on the video-sharing platforms. Users …
challenges to optimize recommender systems on the video-sharing platforms. Users …
Sequential recommendation for optimizing both immediate feedback and long-term retention
In Recommender System (RS) applications, reinforcement learning (RL) has recently
emerged as a powerful tool, primarily due to its proficiency in optimizing long-term rewards …
emerged as a powerful tool, primarily due to its proficiency in optimizing long-term rewards …
Reinforcing user retention in a billion scale short video recommender system
Recently, short video platforms have achieved rapid user growth by recommending
interesting content to users. The objective of the recommendation is to optimize user …
interesting content to users. The objective of the recommendation is to optimize user …
Multi-task recommendations with reinforcement learning
In recent years, Multi-task Learning (MTL) has yielded immense success in Recommender
System (RS) applications [40]. However, current MTL-based recommendation models tend …
System (RS) applications [40]. However, current MTL-based recommendation models tend …
Transact: Transformer-based realtime user action model for recommendation at pinterest
Sequential models that encode user activity for next action prediction have become a
popular design choice for building web-scale personalized recommendation systems …
popular design choice for building web-scale personalized recommendation systems …
A survey on multi-objective recommender systems
Recommender systems can be characterized as software solutions that provide users with
convenient access to relevant content. Traditionally, recommender systems research …
convenient access to relevant content. Traditionally, recommender systems research …