Group fairness in predict-then-optimize settings for restless bandits

S Verma, Y Zhao, S Shah, N Boehmer… - The 40th Conference …, 2024 - openreview.net
Restless multi-arm bandits (RMABs) are a model for sequentially allocating a limited number
of resources to agents modeled as Markov Decision Processes. RMABs have applications in …

The bandit whisperer: Communication learning for restless bandits

Y Zhao, T Wang, D Nagaraj, A Taneja… - arxiv preprint arxiv …, 2024 - arxiv.org
Applying Reinforcement Learning (RL) to Restless Multi-Arm Bandits (RMABs) offers a
promising avenue for addressing allocation problems with resource constraints and …

[PDF][PDF] Beyond" to act or not to act": Fast lagrangian approaches to general multi-action restless bandits

JA Killian, A Perrault, M Tambe - Proceedings of the 20th International …, 2021 - ifaamas.org
Beyond "To Act or Not to Act": Fast Lagrangian Approaches to General Multi-Action Restless
Bandits Page 1 Beyond "To Act or Not to Act": Fast Lagrangian Approaches to General Multi-Action …

Indexability is not enough for whittle: Improved, near-optimal algorithms for restless bandits

A Ghosh, D Nagaraj, M Jain, M Tambe - arxiv preprint arxiv:2211.00112, 2022 - arxiv.org
We study the problem of planning restless multi-armed bandits (RMABs) with multiple
actions. This is a popular model for multi-agent systems with applications like multi-channel …

Evaluation of real-time predictive spectrum sharing for cognitive radar

JA Kovarskiy, BH Kirk, AF Martone… - … on Aerospace and …, 2020 - ieeexplore.ieee.org
The growing demand for radio frequency (RF) spectrum access poses new challenges for
next-generation radar systems. To operate in a crowded electromagnetic environment …

[PDF][PDF] Towards zero shot learning in restless multi-armed bandits

Y Zhao, N Behari, E Hughes, E Zhang… - Proceedings of the 23rd …, 2024 - ifaamas.org
Restless multi-arm bandits (RMABs), a class of resource allocation problems with broad
application in areas such as healthcare, online advertising, and anti-poaching, have recently …

Improving the prediction of individual engagement in recommendations using cognitive models

R Seow, Y Zhao, D Wood, M Tambe… - arxiv preprint arxiv …, 2024 - arxiv.org
For public health programs with limited resources, the ability to predict how behaviors
change over time and in response to interventions is crucial for deciding when and to whom …

Equitable restless multi-armed bandits: a general framework inspired by digital health

JA Killian, M Jain, Y Jia, J Amar, E Huang… - arxiv preprint arxiv …, 2023 - arxiv.org
Restless multi-armed bandits (RMABs) are a popular framework for algorithmic decision
making in sequential settings with limited resources. RMABs are increasingly being used for …

Fairness of exposure in online restless multi-armed bandits

A Sood, S Jain, S Gujar - arxiv preprint arxiv:2402.06348, 2024 - arxiv.org
Restless multi-armed bandits (RMABs) generalize the multi-armed bandits where each arm
exhibits Markovian behavior and transitions according to their transition dynamics. Solutions …

Beam Alignment in Multipath Environments for Integrated Sensing and Communication Using Bandit Learning

A Sneh, SS Ram, SJ Darak… - IEEE Journal of Selected …, 2024 - ieeexplore.ieee.org
Prior works have explored multi-armed bandit (MAB) algorithms for the selection of optimal
beams for millimeter-wave (mmW) communications between base station and mobile users …