Google Académico

Y Liang, Z Shi, Z Song, Y Zhou - arxiv preprint arxiv:2405.16411, 2024 - arxiv.org

Tensor Attention, a multi-view attention that is able to capture high-order correlations among
multiple modalities, can overcome the representational limitations of classical matrix …

Guardar Citar Citado por 27 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Collaborative filtering bandits

S Li, A Karatzoglou, C Gentile - … of the 39th International ACM SIGIR …, 2016 - dl.acm.org

Classical collaborative filtering, and content-based filtering methods try to learn a static
recommendation model given training data. These approaches are far from ideal in highly …

Guardar Citar Citado por 422 Artículos relacionados Las 7 versiones

[Free GPT-4]

[PDF] nju.edu.cn

Transfer learning

SJ Pan - Learning, 2020 - api.taylorfrancis.com

Supervised machine learning techniques have already been widely studied and applied to
various real-world applications. However, most existing supervised algorithms work well …

Guardar Citar Citado por 513 Artículos relacionados Las 6 versiones Búsqueda de bibliotecas

[Free GPT-4]

[PDF] mlr.press

Online clustering of bandits

C Gentile, S Li, G Zappella - International conference on …, 2014 - proceedings.mlr.press

We introduce a novel algorithmic approach to content recommendation based on adaptive
clustering of exploration-exploitation (“bandit") strategies. We provide a sharp regret …

Guardar Citar Citado por 332 Artículos relacionados Las 13 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

Meta-thompson sampling

B Kveton, M Konobeev, M Zaheer… - International …, 2021 - proceedings.mlr.press

Efficient exploration in bandits is a fundamental online learning problem. We propose a
variant of Thompson sampling that learns to explore better as it interacts with bandit …

Guardar Citar Citado por 81 Artículos relacionados Las 11 versiones Versión en HTML

[Free GPT-4]

[PDF] neurips.cc

Bayesian decision-making under misspecified priors with applications to meta-learning

M Simchowitz, C Tosh… - Advances in …, 2021 - proceedings.neurips.cc

Thompson sampling and other Bayesian sequential decision-making algorithms are among
the most popular approaches to tackle explore/exploit trade-offs in (contextual) bandits. The …

Guardar Citar Citado por 59 Artículos relacionados Las 7 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

On context-dependent clustering of bandits

C Gentile, S Li, P Kar, A Karatzoglou… - International …, 2017 - proceedings.mlr.press

We investigate a novel cluster-of-bandit algorithm CAB for collaborative recommendation
tasks that implements the underlying feedback sharing mechanism by estimating user …

Guardar Citar Citado por 159 Artículos relacionados Las 9 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

Hierarchical bayesian bandits

J Hong, B Kveton, M Zaheer… - International …, 2022 - proceedings.mlr.press

Abstract Meta-, multi-task, and federated learning can be all viewed as solving similar tasks,
drawn from a distribution that reflects task similarities. We provide a unified view of all these …

Guardar Citar Citado por 46 Artículos relacionados Las 4 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Multi-armed bandits for intelligent tutoring systems

B Clement, D Roy, PY Oudeyer, M Lopes - arxiv preprint arxiv:1310.3174, 2013 - arxiv.org

We present an approach to Intelligent Tutoring Systems which adaptively personalizes
sequences of learning activities to maximize skills acquired by students, taking into account …

Guardar Citar Citado por 193 Artículos relacionados Las 18 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

Provable benefits of representational transfer in reinforcement learning

A Agarwal, Y Song, W Sun, K Wang… - The Thirty Sixth …, 2023 - proceedings.mlr.press

We study the problem of representational transfer in RL, where an agent first pretrains in a
number of\emph {source tasks} to discover a shared representation, which is subsequently …

Guardar Citar Citado por 31 Artículos relacionados Las 8 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Sequential transfer in multi-armed bandit with finite set of models

Tensor attention training: Provably efficient learning of higher-order transformers

Collaborative filtering bandits

Transfer learning

Online clustering of bandits

Meta-thompson sampling

Bayesian decision-making under misspecified priors with applications to meta-learning

On context-dependent clustering of bandits

Hierarchical bayesian bandits

Multi-armed bandits for intelligent tutoring systems

Provable benefits of representational transfer in reinforcement learning