- Academic Search

S Verma, Y Zhao, S Shah, N Boehmer… - The 40th Conference …, 2024 - openreview.net

Restless multi-arm bandits (RMABs) are a model for sequentially allocating a limited number
of resources to agents modeled as Markov Decision Processes. RMABs have applications in …

Lagre Referanse Sitert av 7 Beslektede artikler Alle 6 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The bandit whisperer: Communication learning for restless bandits

Y Zhao, T Wang, D Nagaraj, A Taneja… - arxiv preprint arxiv …, 2024 - arxiv.org

Applying Reinforcement Learning (RL) to Restless Multi-Arm Bandits (RMABs) offers a
promising avenue for addressing allocation problems with resource constraints and …

Lagre Referanse Sitert av 5 Beslektede artikler Alle 4 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] ifaamas.org

[PDF][PDF] Beyond" to act or not to act": Fast lagrangian approaches to general multi-action restless bandits

JA Killian, A Perrault, M Tambe - Proceedings of the 20th International …, 2021 - ifaamas.org

Beyond "To Act or Not to Act": Fast Lagrangian Approaches to General Multi-Action Restless
Bandits Page 1 Beyond "To Act or Not to Act": Fast Lagrangian Approaches to General Multi-Action …

Lagre Referanse Sitert av 36 Beslektede artikler Alle 14 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Indexability is not enough for whittle: Improved, near-optimal algorithms for restless bandits

A Ghosh, D Nagaraj, M Jain, M Tambe - arxiv preprint arxiv:2211.00112, 2022 - arxiv.org

We study the problem of planning restless multi-armed bandits (RMABs) with multiple
actions. This is a popular model for multi-agent systems with applications like multi-channel …

Lagre Referanse Sitert av 14 Beslektede artikler Alle 9 versjoner HTML-versjon

Evaluation of real-time predictive spectrum sharing for cognitive radar

JA Kovarskiy, BH Kirk, AF Martone… - … on Aerospace and …, 2020 - ieeexplore.ieee.org

The growing demand for radio frequency (RF) spectrum access poses new challenges for
next-generation radar systems. To operate in a crowded electromagnetic environment …

Lagre Referanse Sitert av 30 Beslektede artikler Alle 4 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] ifaamas.org

[PDF][PDF] Towards zero shot learning in restless multi-armed bandits

Y Zhao, N Behari, E Hughes, E Zhang… - Proceedings of the 23rd …, 2024 - ifaamas.org

Restless multi-arm bandits (RMABs), a class of resource allocation problems with broad
application in areas such as healthcare, online advertising, and anti-poaching, have recently …

Lagre Referanse Sitert av 5 Beslektede artikler Alle 5 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Improving the prediction of individual engagement in recommendations using cognitive models

R Seow, Y Zhao, D Wood, M Tambe… - arxiv preprint arxiv …, 2024 - arxiv.org

For public health programs with limited resources, the ability to predict how behaviors
change over time and in response to interventions is crucial for deciding when and to whom …

Lagre Referanse Sitert av 4 Beslektede artikler Alle 3 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Equitable restless multi-armed bandits: a general framework inspired by digital health

JA Killian, M Jain, Y Jia, J Amar, E Huang… - arxiv preprint arxiv …, 2023 - arxiv.org

Restless multi-armed bandits (RMABs) are a popular framework for algorithmic decision
making in sequential settings with limited resources. RMABs are increasingly being used for …

Lagre Referanse Sitert av 8 Beslektede artikler Alle 2 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Fairness of exposure in online restless multi-armed bandits

A Sood, S Jain, S Gujar - arxiv preprint arxiv:2402.06348, 2024 - arxiv.org

Restless multi-armed bandits (RMABs) generalize the multi-armed bandits where each arm
exhibits Markovian behavior and transitions according to their transition dynamics. Solutions …

Lagre Referanse Sitert av 2 Beslektede artikler Alle 6 versjoner HTML-versjon

Beam Alignment in Multipath Environments for Integrated Sensing and Communication Using Bandit Learning

A Sneh, SS Ram, SJ Darak… - IEEE Journal of Selected …, 2024 - ieeexplore.ieee.org

Prior works have explored multi-armed bandit (MAB) algorithms for the selection of optimal
beams for millimeter-wave (mmW) communications between base station and mobile users …

Lagre Referanse Beslektede artikler Alle 2 versjoner

Opprett varsel

Referanse

Avansert søk

Lagret i Mitt bibliotek

Transfer restless multi-armed bandit policy for energy-efficient heterogeneous cellular network

Group fairness in predict-then-optimize settings for restless bandits

The bandit whisperer: Communication learning for restless bandits

[PDF][PDF] Beyond" to act or not to act": Fast lagrangian approaches to general multi-action restless bandits

Indexability is not enough for whittle: Improved, near-optimal algorithms for restless bandits

Evaluation of real-time predictive spectrum sharing for cognitive radar

[PDF][PDF] Towards zero shot learning in restless multi-armed bandits

Improving the prediction of individual engagement in recommendations using cognitive models

Equitable restless multi-armed bandits: a general framework inspired by digital health

Fairness of exposure in online restless multi-armed bandits

Beam Alignment in Multipath Environments for Integrated Sensing and Communication Using Bandit Learning