A tutorial on thompson sampling

DJ Russo, B Van Roy, A Kazerouni… - … and Trends® in …, 2018 - nowpublishers.com
Thompson sampling is an algorithm for online decision problems where actions are taken
sequentially in a manner that must balance between exploiting what is known to maximize …

Fashion recommendation systems, models and methods: A review

S Chakraborty, MS Hoque, N Rahman Jeem… - Informatics, 2021 - mdpi.com
In recent years, the textile and fashion industries have witnessed an enormous amount of
growth in fast fashion. On e-commerce platforms, where numerous choices are available, an …

Poem: Out-of-distribution detection with posterior sampling

Y Ming, Y Fan, Y Li - International Conference on Machine …, 2022 - proceedings.mlr.press
Abstract Out-of-distribution (OOD) detection is indispensable for machine learning models
deployed in the open world. Recently, the use of an auxiliary outlier dataset during training …

Neural thompson sampling

W Zhang, D Zhou, L Li, Q Gu - arxiv preprint arxiv:2010.00827, 2020 - arxiv.org
Thompson Sampling (TS) is one of the most effective algorithms for solving contextual multi-
armed bandit problems. In this paper, we propose a new algorithm, called Neural Thompson …

[LIBRO][B] Bandit algorithms

T Lattimore, C Szepesvári - 2020 - books.google.com
Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

DRN: A deep reinforcement learning framework for news recommendation

G Zheng, F Zhang, Z Zheng, Y **ang, NJ Yuan… - Proceedings of the …, 2018 - dl.acm.org
In this paper, we propose a novel Deep Reinforcement Learning framework for news
recommendation. Online personalized news recommendation is a highly challenging …

Towards conversational recommender systems

K Christakopoulou, F Radlinski… - Proceedings of the 22nd …, 2016 - dl.acm.org
People often ask others for restaurant recommendations as a way to discover new dining
experiences. This makes restaurant recommendation an exciting scenario for recommender …

Multi-armed bandits in recommendation systems: A survey of the state-of-the-art and future directions

N Silva, H Werneck, T Silva, ACM Pereira… - Expert Systems with …, 2022 - Elsevier
Abstract Recommender Systems (RSs) have assumed a crucial role in several digital
companies by directly affecting their key performance indicators. Nowadays, in this era of big …

Collaborative filtering bandits

S Li, A Karatzoglou, C Gentile - … of the 39th International ACM SIGIR …, 2016 - dl.acm.org
Classical collaborative filtering, and content-based filtering methods try to learn a static
recommendation model given training data. These approaches are far from ideal in highly …

A deep reinforcement learning based long-term recommender system

L Huang, M Fu, F Li, H Qu, Y Liu, W Chen - Knowledge-based systems, 2021 - Elsevier
Recommender systems aim to maximize the overall accuracy for long-term
recommendations. However, most of the existing recommendation models adopt a static …