- Academic Search

Towards a comprehensive framework for the multidisciplinary evaluation of organizational maturity on business continuity program management: A systematic …

N Russo, L Reis, C Silveira… - … Security Journal: A Global …, 2024 - Taylor & Francis

Organizational dependency on Information and Communication Technology (ICT) drives the
preparedness challenge to cope with business process disruptions. Business Continuity …

Zapisz Cytuj Cytowane przez 15 Powiązane artykuły Wszystkie wersje 6

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Feel-good thompson sampling for contextual bandits and reinforcement learning

T Zhang - SIAM Journal on Mathematics of Data Science, 2022 - SIAM

Thompson sampling has been widely used for contextual bandit problems due to the
flexibility of its modeling power. However, a general theory for this class of methods in the …

Zapisz Cytuj Cytowane przez 72 Powiązane artykuły Wszystkie wersje 4

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Meta-thompson sampling

B Kveton, M Konobeev, M Zaheer… - International …, 2021 - proceedings.mlr.press

Efficient exploration in bandits is a fundamental online learning problem. We propose a
variant of Thompson sampling that learns to explore better as it interacts with bandit …

Zapisz Cytuj Cytowane przez 82 Powiązane artykuły Wszystkie wersje 11 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Hierarchical bayesian bandits

J Hong, B Kveton, M Zaheer… - International …, 2022 - proceedings.mlr.press

Abstract Meta-, multi-task, and federated learning can be all viewed as solving similar tasks,
drawn from a distribution that reflects task similarities. We provide a unified view of all these …

Zapisz Cytuj Cytowane przez 46 Powiązane artykuły Wszystkie wersje 4 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Scalable neural contextual bandit for recommender systems

Z Zhu, B Van Roy - Proceedings of the 32nd ACM International …, 2023 - dl.acm.org

High-quality recommender systems ought to deliver both innovative and relevant content
through effective and exploratory interactions with users. Yet, supervised learning-based …

Zapisz Cytuj Cytowane przez 15 Powiązane artykuły Wszystkie wersje 4

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

No regrets for learning the prior in bandits

S Basu, B Kveton, M Zaheer… - Advances in neural …, 2021 - proceedings.neurips.cc

Abstract We propose AdaTS, a Thompson sampling algorithm that adapts sequentially to
bandit tasks that it interacts with. The key idea in AdaTS is to adapt to an unknown task prior …

Zapisz Cytuj Cytowane przez 39 Powiązane artykuły Wszystkie wersje 7 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Metadata-based multi-task bandits with bayesian hierarchical models

R Wan, L Ge, R Song - Advances in Neural Information …, 2021 - proceedings.neurips.cc

How to explore efficiently is a central problem in multi-armed bandits. In this paper, we
introduce the metadata-based multi-task bandit problem, where the agent needs to solve a …

Zapisz Cytuj Cytowane przez 32 Powiązane artykuły Wszystkie wersje 6 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Deep hierarchy in bandits

J Hong, B Kveton, S Katariya… - International …, 2022 - proceedings.mlr.press

Mean rewards of actions are often correlated. The form of these correlations may be
complex and unknown a priori, such as the preferences of users for recommended products …

Zapisz Cytuj Cytowane przez 22 Powiązane artykuły Wszystkie wersje 6 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Learning mixtures of linear dynamical systems

Y Chen, HV Poor - International conference on machine …, 2022 - proceedings.mlr.press

We study the problem of learning a mixture of multiple linear dynamical systems (LDSs) from
unlabeled short sample trajectories, each generated by one of the LDS models. Despite the …

Zapisz Cytuj Cytowane przez 23 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reinforcement learning for efficient and tuning-free link adaptation

V Saxena, H Tullberg, J Jaldén - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Wireless links adapt the data transmission parameters to the dynamic channel state–this is
called link adaptation. Classical link adaptation relies on tuning parameters that are …

Zapisz Cytuj Cytowane przez 45 Powiązane artykuły Wszystkie wersje 6

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

Latent bandits revisited

Towards a comprehensive framework for the multidisciplinary evaluation of organizational maturity on business continuity program management: A systematic …

Feel-good thompson sampling for contextual bandits and reinforcement learning

Meta-thompson sampling

Hierarchical bayesian bandits

Scalable neural contextual bandit for recommender systems

No regrets for learning the prior in bandits

Metadata-based multi-task bandits with bayesian hierarchical models

Deep hierarchy in bandits

Learning mixtures of linear dynamical systems

Reinforcement learning for efficient and tuning-free link adaptation