DRN: A deep reinforcement learning framework for news recommendation

G Zheng, F Zhang, Z Zheng, Y **ang, NJ Yuan… - Proceedings of the …, 2018‏ - dl.acm.org
In this paper, we propose a novel Deep Reinforcement Learning framework for news
recommendation. Online personalized news recommendation is a highly challenging …

Estimation-action-reflection: Towards deep interaction between conversational and recommender systems

W Lei, X He, Y Miao, Q Wu, R Hong, MY Kan… - Proceedings of the 13th …, 2020‏ - dl.acm.org
Recommender systems are embracing conversational technologies to obtain user
preferences dynamically, and to overcome inherent limitations of their static models. A …

Multi-armed bandits in recommendation systems: A survey of the state-of-the-art and future directions

N Silva, H Werneck, T Silva, ACM Pereira… - Expert Systems with …, 2022‏ - Elsevier
Abstract Recommender Systems (RSs) have assumed a crucial role in several digital
companies by directly affecting their key performance indicators. Nowadays, in this era of big …

Collaborative filtering bandits

S Li, A Karatzoglou, C Gentile - … of the 39th International ACM SIGIR …, 2016‏ - dl.acm.org
Classical collaborative filtering, and content-based filtering methods try to learn a static
recommendation model given training data. These approaches are far from ideal in highly …

Reinforcement learning for personalization: A systematic literature review

F Den Hengst, EM Grua, A el Hassouni… - Data …, 2020‏ - journals.sagepub.com
The major application areas of reinforcement learning (RL) have traditionally been game
playing and continuous control. In recent years, however, RL has been increasingly applied …

Field study in deploying restless multi-armed bandits: Assisting non-profits in improving maternal and child health

A Mate, L Madaan, A Taneja, N Madhiwalla… - Proceedings of the …, 2022‏ - ojs.aaai.org
The widespread availability of cell phones has enabled non-profits to deliver critical health
information to their beneficiaries in a timely manner. This paper describes our work to assist …

Federated linear contextual bandits

R Huang, W Wu, J Yang… - Advances in neural …, 2021‏ - proceedings.neurips.cc
This paper presents a novel federated linear contextual bandits model, where individual
clients face different $ K $-armed stochastic bandits coupled through common global …

Dynamically expandable graph convolution for streaming recommendation

B He, X He, Y Zhang, R Tang, C Ma - … of the ACM Web Conference 2023, 2023‏ - dl.acm.org
Personalized recommender systems have been widely studied and deployed to reduce
information overload and satisfy users' diverse needs. However, conventional …

Coordinate Descent Method for -means

F Nie, J Xue, D Wu, R Wang, H Li… - IEEE Transactions on …, 2021‏ - ieeexplore.ieee.org
-means method using Lloyd heuristic is a traditional clustering method which has played a
key role in multiple downstream tasks of machine learning because of its simplicity …

Explaining the success of nearest neighbor methods in prediction

GH Chen, D Shah - Foundations and Trends® in Machine …, 2018‏ - nowpublishers.com
Many modern methods for prediction leverage nearest neighbor search to find past training
examples most similar to a test example, an idea that dates back in text to at least the 11th …