A tutorial on thompson sampling

DJ Russo, B Van Roy, A Kazerouni… - … and Trends® in …, 2018 - nowpublishers.com
Thompson sampling is an algorithm for online decision problems where actions are taken
sequentially in a manner that must balance between exploiting what is known to maximize …

Click-through rate prediction in online advertising: A literature review

Y Yang, P Zhai - Information Processing & Management, 2022 - Elsevier
Predicting the probability that a user will click on a specific advertisement has been a
prevalent issue in online advertising, attracting much research attention in the past decades …

Neural thompson sampling

W Zhang, D Zhou, L Li, Q Gu - arxiv preprint arxiv:2010.00827, 2020 - arxiv.org
Thompson Sampling (TS) is one of the most effective algorithms for solving contextual multi-
armed bandit problems. In this paper, we propose a new algorithm, called Neural Thompson …

Autoint: Automatic feature interaction learning via self-attentive neural networks

W Song, C Shi, Z **ao, Z Duan, Y Xu, M Zhang… - Proceedings of the 28th …, 2019 - dl.acm.org
Click-through rate (CTR) prediction, which aims to predict the probability of a user clicking
on an ad or an item, is critical to many online applications such as online advertising and …

[ΒΙΒΛΙΟ][B] Bandit algorithms

T Lattimore, C Szepesvári - 2020 - books.google.com
Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

[ΒΙΒΛΙΟ][B] Mathematics for machine learning

MP Deisenroth, AA Faisal, CS Ong - 2020 - books.google.com
The fundamental mathematical tools needed to understand machine learning include linear
algebra, analytic geometry, matrix decompositions, vector calculus, optimization, probability …

DeepFM: a factorization-machine based neural network for CTR prediction

H Guo, R Tang, Y Ye, Z Li, X He - arxiv preprint arxiv:1703.04247, 2017 - arxiv.org
Learning sophisticated feature interactions behind user behaviors is critical in maximizing
CTR for recommender systems. Despite great progress, existing methods seem to have a …

Product-based neural networks for user response prediction

Y Qu, H Cai, K Ren, W Zhang, Y Yu… - 2016 IEEE 16th …, 2016 - ieeexplore.ieee.org
Predicting user responses, such as clicks and conversions, is of great importance and has
found its usage inmany Web applications including recommender systems, websearch and …

Clipper: A {Low-Latency} online prediction serving system

D Crankshaw, X Wang, G Zhou, MJ Franklin… - … USENIX Symposium on …, 2017 - usenix.org
Clipper: A Low-Latency Online Prediction Serving System Page 1 This paper is included in the
Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation …

FiBiNET: combining feature importance and bilinear feature interaction for click-through rate prediction

T Huang, Z Zhang, J Zhang - Proceedings of the 13th ACM conference …, 2019 - dl.acm.org
Advertising and feed ranking are essential to many Internet companies such as Facebook
and Sina Weibo. Among many real-world advertising and feed ranking systems, click …