Towards a comprehensive framework for the multidisciplinary evaluation of organizational maturity on business continuity program management: A systematic …

N Russo, L Reis, C Silveira… - … Security Journal: A Global …, 2024 - Taylor & Francis
Organizational dependency on Information and Communication Technology (ICT) drives the
preparedness challenge to cope with business process disruptions. Business Continuity …

Non-stationary bandits with auto-regressive temporal dependency

Q Chen, N Golrezaei… - Advances in Neural …, 2023 - proceedings.neurips.cc
Traditional multi-armed bandit (MAB) frameworks, predominantly examined under stochastic
or adversarial settings, often overlook the temporal dynamics inherent in many real-world …

Continual learning as computationally constrained reinforcement learning

S Kumar, H Marklund, A Rao, Y Zhu, HJ Jeon… - arxiv preprint arxiv …, 2023 - arxiv.org
An agent that efficiently accumulates knowledge to develop increasingly sophisticated skills
over a long lifetime could advance the frontier of artificial intelligence capabilities. The …

Stochastic rising bandits

AM Metelli, F Trovo, M Pirola… - … Conference on Machine …, 2022 - proceedings.mlr.press
This paper is in the field of stochastic Multi-Armed Bandits (MABs), ie, those sequential
selection techniques able to learn online using only the feedback given by the chosen …

An information-theoretic analysis of nonstationary bandit learning

S Min, D Russo - International Conference on Machine …, 2023 - proceedings.mlr.press
In nonstationary bandit learning problems, the decision-maker must continually gather
information and adapt their action selection as the latent state of the environment evolves. In …

Non stationary multi-armed bandit: Empirical evaluation of a new concept drift-aware algorithm

E Cavenaghi, G Sottocornola, F Stella, M Zanker - Entropy, 2021 - mdpi.com
The Multi-Armed Bandit (MAB) problem has been extensively studied in order to address
real-world challenges related to sequential decision making. In this setting, an agent selects …

Nonstationary bandit learning via predictive sampling

Y Liu, B Van Roy, K Xu - International Conference on …, 2023 - proceedings.mlr.press
Thompson sampling has proven effective across a wide range of stationary bandit
environments. However, as we demonstrate in this paper, it can perform poorly when …

Smooth non-stationary bandits

S Jia, Q **e, N Kallus, PI Frazier - … Conference on Machine …, 2023 - proceedings.mlr.press
In many applications of online decision making, the environment is non-stationary and it is
therefore crucial to use bandit algorithms that handle changes. Most existing approaches …

Which LLM to Play? Convergence-Aware Online Model Selection with Time-Increasing Bandits

Y **a, F Kong, T Yu, L Guo, RA Rossi, S Kim… - Proceedings of the ACM …, 2024 - dl.acm.org
Web-based applications such as chatbots, search engines and news recommendations
continue to grow in scale and complexity with the recent surge in the adoption of large …

On limited-memory subsampling strategies for bandits

D Baudry, Y Russac, O Cappé - International Conference on …, 2021 - proceedings.mlr.press
There has been a recent surge of interest in non-parametric bandit algorithms based on
subsampling. One drawback however of these approaches is the additional complexity …