Google Académico

DJ Russo, B Van Roy, A Kazerouni… - … and Trends® in …, 2018 - nowpublishers.com

Thompson sampling is an algorithm for online decision problems where actions are taken
sequentially in a manner that must balance between exploiting what is known to maximize …

Guardar Citar Citado por 1293 Artículos relacionados Las 34 versiones Búsqueda de bibliotecas Versión en HTML

[Free GPT-4]

[PDF] mdpi.com

Fashion recommendation systems, models and methods: A review

S Chakraborty, MS Hoque, N Rahman Jeem… - Informatics, 2021 - mdpi.com

In recent years, the textile and fashion industries have witnessed an enormous amount of
growth in fast fashion. On e-commerce platforms, where numerous choices are available, an …

Guardar Citar Citado por 78 Artículos relacionados Las 4 versiones En caché

[Free GPT-4]

[PDF] mlr.press

Poem: Out-of-distribution detection with posterior sampling

Y Ming, Y Fan, Y Li - International Conference on Machine …, 2022 - proceedings.mlr.press

Abstract Out-of-distribution (OOD) detection is indispensable for machine learning models
deployed in the open world. Recently, the use of an auxiliary outlier dataset during training …

Guardar Citar Citado por 116 Artículos relacionados Las 4 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Neural thompson sampling

W Zhang, D Zhou, L Li, Q Gu - arxiv preprint arxiv:2010.00827, 2020 - arxiv.org

Thompson Sampling (TS) is one of the most effective algorithms for solving contextual multi-
armed bandit problems. In this paper, we propose a new algorithm, called Neural Thompson …

Guardar Citar Citado por 280 Artículos relacionados Las 8 versiones Versión en HTML

[Free GPT-4]

[PDF] tor-lattimore.com

[LIBRO][B] Bandit algorithms

T Lattimore, C Szepesvári - 2020 - books.google.com

Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

Guardar Citar Citado por 3281 Artículos relacionados Las 9 versiones Búsqueda de bibliotecas

[Free GPT-4]

[PDF] acm.org

DRN: A deep reinforcement learning framework for news recommendation

G Zheng, F Zhang, Z Zheng, Y **ang, NJ Yuan… - Proceedings of the …, 2018 - dl.acm.org

In this paper, we propose a novel Deep Reinforcement Learning framework for news
recommendation. Online personalized news recommendation is a highly challenging …

Guardar Citar Citado por 888 Artículos relacionados Las 6 versiones

[Free GPT-4]

[PDF] microsoft.com

Towards conversational recommender systems

K Christakopoulou, F Radlinski… - Proceedings of the 22nd …, 2016 - dl.acm.org

People often ask others for restaurant recommendations as a way to discover new dining
experiences. This makes restaurant recommendation an exciting scenario for recommender …

Guardar Citar Citado por 514 Artículos relacionados Las 7 versiones

Multi-armed bandits in recommendation systems: A survey of the state-of-the-art and future directions

N Silva, H Werneck, T Silva, ACM Pereira… - Expert Systems with …, 2022 - Elsevier

Abstract Recommender Systems (RSs) have assumed a crucial role in several digital
companies by directly affecting their key performance indicators. Nowadays, in this era of big …

Guardar Citar Citado por 81 Artículos relacionados Las 2 versiones

[Free GPT-4]

[PDF] arxiv.org

Collaborative filtering bandits

S Li, A Karatzoglou, C Gentile - … of the 39th International ACM SIGIR …, 2016 - dl.acm.org

Classical collaborative filtering, and content-based filtering methods try to learn a static
recommendation model given training data. These approaches are far from ideal in highly …

Guardar Citar Citado por 422 Artículos relacionados Las 7 versiones

A deep reinforcement learning based long-term recommender system

L Huang, M Fu, F Li, H Qu, Y Liu, W Chen - Knowledge-based systems, 2021 - Elsevier

Recommender systems aim to maximize the overall accuracy for long-term
recommendations. However, most of the existing recommendation models adopt a static …

Guardar Citar Citado por 114 Artículos relacionados Las 3 versiones

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Efficient Thompson Sampling for Online Matrix-Factorization Recommendation

A tutorial on thompson sampling

Fashion recommendation systems, models and methods: A review

Poem: Out-of-distribution detection with posterior sampling

Neural thompson sampling

[LIBRO][B] Bandit algorithms

DRN: A deep reinforcement learning framework for news recommendation

Towards conversational recommender systems

Multi-armed bandits in recommendation systems: A survey of the state-of-the-art and future directions

Collaborative filtering bandits

A deep reinforcement learning based long-term recommender system