Towards a comprehensive framework for the multidisciplinary evaluation of organizational maturity on business continuity program management: A systematic …

N Russo, L Reis, C Silveira… - … Security Journal: A Global …, 2024 - Taylor & Francis
Organizational dependency on Information and Communication Technology (ICT) drives the
preparedness challenge to cope with business process disruptions. Business Continuity …

Feel-good thompson sampling for contextual bandits and reinforcement learning

T Zhang - SIAM Journal on Mathematics of Data Science, 2022 - SIAM
Thompson sampling has been widely used for contextual bandit problems due to the
flexibility of its modeling power. However, a general theory for this class of methods in the …

Meta-thompson sampling

B Kveton, M Konobeev, M Zaheer… - International …, 2021 - proceedings.mlr.press
Efficient exploration in bandits is a fundamental online learning problem. We propose a
variant of Thompson sampling that learns to explore better as it interacts with bandit …

Hierarchical bayesian bandits

J Hong, B Kveton, M Zaheer… - International …, 2022 - proceedings.mlr.press
Abstract Meta-, multi-task, and federated learning can be all viewed as solving similar tasks,
drawn from a distribution that reflects task similarities. We provide a unified view of all these …

Scalable neural contextual bandit for recommender systems

Z Zhu, B Van Roy - Proceedings of the 32nd ACM International …, 2023 - dl.acm.org
High-quality recommender systems ought to deliver both innovative and relevant content
through effective and exploratory interactions with users. Yet, supervised learning-based …

No regrets for learning the prior in bandits

S Basu, B Kveton, M Zaheer… - Advances in neural …, 2021 - proceedings.neurips.cc
Abstract We propose AdaTS, a Thompson sampling algorithm that adapts sequentially to
bandit tasks that it interacts with. The key idea in AdaTS is to adapt to an unknown task prior …

Metadata-based multi-task bandits with bayesian hierarchical models

R Wan, L Ge, R Song - Advances in Neural Information …, 2021 - proceedings.neurips.cc
How to explore efficiently is a central problem in multi-armed bandits. In this paper, we
introduce the metadata-based multi-task bandit problem, where the agent needs to solve a …

Deep hierarchy in bandits

J Hong, B Kveton, S Katariya… - International …, 2022 - proceedings.mlr.press
Mean rewards of actions are often correlated. The form of these correlations may be
complex and unknown a priori, such as the preferences of users for recommended products …

Learning mixtures of linear dynamical systems

Y Chen, HV Poor - International conference on machine …, 2022 - proceedings.mlr.press
We study the problem of learning a mixture of multiple linear dynamical systems (LDSs) from
unlabeled short sample trajectories, each generated by one of the LDS models. Despite the …

Reinforcement learning for efficient and tuning-free link adaptation

V Saxena, H Tullberg, J Jaldén - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Wireless links adapt the data transmission parameters to the dynamic channel state–this is
called link adaptation. Classical link adaptation relies on tuning parameters that are …