A decision-language model (dlm) for dynamic restless multi-armed bandit tasks in public health

N Behari, E Zhang, Y Zhao, A Taneja… - arxiv preprint arxiv …, 2024 - arxiv.org
Restless multi-armed bandits (RMAB) have demonstrated success in optimizing resource
allocation for large beneficiary populations in public health settings. Unfortunately, RMAB …

Policy space response oracles: A survey

A Bighashdel, Y Wang, S McAleer, R Savani… - arxiv preprint arxiv …, 2024 - arxiv.org
Game theory provides a mathematical way to study the interaction between multiple
decision makers. However, classical game-theoretic analysis is limited in scalability due to …

Improving the prediction of individual engagement in recommendations using cognitive models

R Seow, Y Zhao, D Wood, M Tambe… - arxiv preprint arxiv …, 2024 - arxiv.org
For public health programs with limited resources, the ability to predict how behaviors
change over time and in response to interventions is crucial for deciding when and to whom …

Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI

S Tulli, SL Vasileiou, S Sreedharan - arxiv preprint arxiv:2405.07773, 2024 - arxiv.org
" Human-aware" has become a popular keyword used to describe a particular class of AI
systems that are designed to work and interact with humans. While there exists a surprising …

Uniendo la investigación con la práctica para la equidad: una descripción de la Iniciativa EAAMO

FM Cossío - Revista de Salud Ambiental, 2024 - ojs.diffundit.com
Resumen La iniciativa Equidad y Acceso en Algoritmos, Mecanismos y Optimización
(EAAMO) utiliza investigación interdisciplinaria para abordar desafíos globales, enfatizando …

Fairness for workers who pull the arms: An index based policy for allocation of restless bandit tasks

A Biswas, JA Killian, PR Diaz, S Ghosh… - arxiv preprint arxiv …, 2023 - arxiv.org
Motivated by applications such as machine repair, project monitoring, and anti-poaching
patrol scheduling, we study intervention planning of stochastic processes under resource …

From Restless to Contextual: A Thresholding Bandit Approach to Improve Finite-horizon Performance

J Xu, I Nazarov, A Rastogi, Á Periáñez… - arxiv preprint arxiv …, 2025 - arxiv.org
Online restless bandits extend classic contextual bandits by incorporating state transitions
and budget constraints, representing each agent as a Markov Decision Process (MDP). This …

FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits

S Chakraborty, S Roy, D Basu - arxiv preprint arxiv:2405.14038, 2024 - arxiv.org
High dimensional sparse linear bandits serve as an efficient model for sequential decision-
making problems (eg personalized medicine), where high dimensional features (eg …

[PDF][PDF] Adherence Bandits

JA Killian, A Lalan, A Mate, M Jain… - The Workshop on …, 2023 - projects.iq.harvard.edu
We define a new subclass of the restless multi-armed bandit framework, that we name
Adherence Bandits, designed to capture the dynamics prevalent in many public health …