Google Academic

A Slivkins - Foundations and Trends® in Machine Learning, 2019 - nowpublishers.com

Multi-armed bandits a simple but very powerful framework for algorithms that make
decisions over time under uncertainty. An enormous body of work has accumulated over the …

Salvați Citați Citat de 1272 ori Articole cu conținut similar Toate cele 7 versiuni Căutare Bibliotecă Afișare ca HTML

An overview of deep reinforcement learning for spectrum sensing in cognitive radio networks

F Obite, AD Usman, E Okafor - Digital Signal Processing, 2021 - Elsevier

Deep reinforcement learning has recorded remarkable performance in diverse application
areas of artificial intelligence: pattern recognition, robotics, object segmentation …

Salvați Citați Citat de 42 ori Articole cu conținut similar Toate cele 3 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

On improving model-free algorithms for decentralized multi-agent reinforcement learning

W Mao, L Yang, K Zhang… - … Conference on Machine …, 2022 - proceedings.mlr.press

Multi-agent reinforcement learning (MARL) algorithms often suffer from an exponential
sample complexity dependence on the number of agents, a phenomenon known as the …

Salvați Citați Citat de 72 ori Articole cu conținut similar Toate cele 6 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Distributed multi-player bandits-a game of thrones approach

I Bistritz, A Leshem - Advances in Neural Information …, 2018 - proceedings.neurips.cc

We consider a multi-armed bandit game where N players compete for K arms for T turns.
Each player has different expected rewards for the arms, and the instantaneous rewards are …

Salvați Citați Citat de 166 ori Articole cu conținut similar Toate cele 6 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Distributed learning in multi-armed bandit with multiple players

K Liu, Q Zhao - IEEE transactions on signal processing, 2010 - ieeexplore.ieee.org

We formulate and study a decentralized multi-armed bandit (MAB) problem. There are M
distributed players competing for N independent arms. Each arm, when played, offers iid …

Salvați Citați Citat de 494 ori Articole cu conținut similar Toate cele 16 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Cognitive medium access: Exploration, exploitation, and competition

L Lai, H El Gamal, H Jiang… - IEEE transactions on …, 2010 - ieeexplore.ieee.org

This paper considers the design of efficient strategies that allow cognitive users to choose
frequency bands to sense and access among multiple bands with unknown parameters …

Salvați Citați Citat de 423 ori Articole cu conținut similar Toate cele 17 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] usc.edu

Learning multiuser channel allocations in cognitive radio networks: A combinatorial multi-armed bandit formulation

Y Gai, B Krishnamachari, R Jain - 2010 IEEE Symposium on …, 2010 - ieeexplore.ieee.org

We consider the following fundamental problem in the context of channelized dynamic
spectrum access. There are M secondary users and N¿ M orthogonal channels. Each …

Salvați Citați Citat de 305 ori Articole cu conținut similar Toate cele 11 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] nsf.gov

Distributed multiarmed bandits

J Zhu, J Liu - IEEE Transactions on Automatic Control, 2023 - ieeexplore.ieee.org

This article studies a distributed multiarmed bandit problem with heterogeneous
observations of rewards. The problem is cooperatively solved by agents assuming each …

Salvați Citați Citat de 20 ori Articole cu conținut similar Toate cele 3 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On distributed cooperative decision-making in multiarmed bandits

P Landgren, V Srivastava… - 2016 European Control …, 2016 - ieeexplore.ieee.org

We study the explore-exploit tradeoff in distributed cooperative decision-making using the
context of the multiarmed bandit (MAB) problem. For the distributed cooperative MAB …

Salvați Citați Citat de 91 ori Articole cu conținut similar Toate cele 9 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Dominate or delete: Decentralized competing bandits in serial dictatorship

A Sankararaman, S Basu… - International …, 2021 - proceedings.mlr.press

Online learning in a two-sided matching market, with demand side agents continuously
competing to be matched with supply side (arms), abstracts the complex interactions under …

Salvați Citați Citat de 43 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

Creează alerta

Citați

Căutare avansată

Salvat în Bibliotecă

Medium access in cognitive radio networks: A competitive multi-armed bandit framework

Introduction to multi-armed bandits

An overview of deep reinforcement learning for spectrum sensing in cognitive radio networks

On improving model-free algorithms for decentralized multi-agent reinforcement learning

Distributed multi-player bandits-a game of thrones approach

Distributed learning in multi-armed bandit with multiple players

Cognitive medium access: Exploration, exploitation, and competition

Learning multiuser channel allocations in cognitive radio networks: A combinatorial multi-armed bandit formulation

Distributed multiarmed bandits

On distributed cooperative decision-making in multiarmed bandits

Dominate or delete: Decentralized competing bandits in serial dictatorship