[HTML][HTML] Deep reinforcement learning in recommender systems: A survey and new perspectives

X Chen, L Yao, J McAuley, G Zhou, X Wang - Knowledge-Based Systems, 2023 - Elsevier
In light of the emergence of deep reinforcement learning (DRL) in recommender systems
research and several fruitful results in recent years, this survey aims to provide a timely and …

Alleviating matthew effect of offline reinforcement learning in interactive recommendation

C Gao, K Huang, J Chen, Y Zhang, B Li… - Proceedings of the 46th …, 2023 - dl.acm.org
Offline reinforcement learning (RL), a technology that offline learns a policy from logged data
without the need to interact with online environments, has become a favorable choice in …

Large language models are learnable planners for long-term recommendation

W Shi, X He, Y Zhang, C Gao, X Li, J Zhang… - Proceedings of the 47th …, 2024 - dl.acm.org
Planning for both immediate and long-term benefits becomes increasingly important in
recommendation. Existing methods apply Reinforcement Learning (RL) to learn planning …

[PDF][PDF] A comprehensive survey of small language models in the era of large language models: Techniques, enhancements, applications, collaboration with llms, and …

F Wang, Z Zhang, X Zhang, Z Wu, T Mo, Q Lu… - arxiv preprint arxiv …, 2024 - ai.radensa.ru
Large language models (LLM) have demonstrated emergent abilities in text generation,
question answering, and reasoning, facilitating various tasks and domains. Despite their …

Two-stage constrained actor-critic for short video recommendation

Q Cai, Z Xue, C Zhang, W Xue, S Liu, R Zhan… - Proceedings of the …, 2023 - dl.acm.org
The wide popularity of short videos on social media poses new opportunities and
challenges to optimize recommender systems on the video-sharing platforms. Users …

Sequential recommendation for optimizing both immediate feedback and long-term retention

Z Liu, S Liu, Z Zhang, Q Cai, X Zhao, K Zhao… - Proceedings of the 47th …, 2024 - dl.acm.org
In Recommender System (RS) applications, reinforcement learning (RL) has recently
emerged as a powerful tool, primarily due to its proficiency in optimizing long-term rewards …

Reinforcing user retention in a billion scale short video recommender system

Q Cai, S Liu, X Wang, T Zuo, W **e, B Yang… - … Proceedings of the …, 2023 - dl.acm.org
Recently, short video platforms have achieved rapid user growth by recommending
interesting content to users. The objective of the recommendation is to optimize user …

Multi-task recommendations with reinforcement learning

Z Liu, J Tian, Q Cai, X Zhao, J Gao, S Liu… - Proceedings of the …, 2023 - dl.acm.org
In recent years, Multi-task Learning (MTL) has yielded immense success in Recommender
System (RS) applications [40]. However, current MTL-based recommendation models tend …

Transact: Transformer-based realtime user action model for recommendation at pinterest

X **a, P Eksombatchai, N Pancha, DD Badani… - Proceedings of the 29th …, 2023 - dl.acm.org
Sequential models that encode user activity for next action prediction have become a
popular design choice for building web-scale personalized recommendation systems …

A survey on multi-objective recommender systems

D Jannach, H Abdollahpouri - Frontiers in big Data, 2023 - frontiersin.org
Recommender systems can be characterized as software solutions that provide users with
convenient access to relevant content. Traditionally, recommender systems research …