Google Akademik

X Li, L **e, N Li - Annual Reviews in Control, 2023 - Elsevier

Distributed online optimization and online games have been increasingly researched in the
last decade, mostly motivated by their wide applications in sensor networks, robotics (eg …

Kaydet Alıntı yap Alıntılanma sayısı: 41 İlgili makaleler

[Free GPT-4]

[PDF] mlr.press

Online meta-learning

C Finn, A Rajeswaran, S Kakade… - … on machine learning, 2019 - proceedings.mlr.press

A central capability of intelligent systems is the ability to continuously build upon previous
experiences to speed up and enhance learning of new tasks. Two distinct research …

Kaydet Alıntı yap Alıntılanma sayısı: 545 İlgili makaleler 12 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Distributed online optimization in dynamic environments using mirror descent

S Shahrampour, A Jadbabaie - IEEE Transactions on Automatic …, 2017 - ieeexplore.ieee.org

This work addresses decentralized online optimization in nonstationary environments. A
network of agents aim to track the minimizer of a global, time-varying, and convex function …

Kaydet Alıntı yap Alıntılanma sayısı: 322 İlgili makaleler 6 sürümün hepsi

[Free GPT-4]

[PDF] neurips.cc

Dynamic regret of convex and smooth functions

P Zhao, YJ Zhang, L Zhang… - Advances in Neural …, 2020 - proceedings.neurips.cc

We investigate online convex optimization in non-stationary environments and choose the
dynamic regret as the performance measure, defined as the difference between cumulative …

Kaydet Alıntı yap Alıntılanma sayısı: 105 İlgili makaleler 10 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] mlr.press

No-regret learning in time-varying zero-sum games

M Zhang, P Zhao, H Luo… - … Conference on Machine …, 2022 - proceedings.mlr.press

Learning from repeated play in a fixed two-player zero-sum game is a classic problem in
game theory and online learning. We consider a variant of this problem where the game …

Kaydet Alıntı yap Alıntılanma sayısı: 50 İlgili makaleler 7 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] mlr.press

A new algorithm for non-stationary contextual bandits: Efficient, optimal and parameter-free

Y Chen, CW Lee, H Luo… - Conference on Learning …, 2019 - proceedings.mlr.press

We propose the first contextual bandit algorithm that is parameter-free, efficient, and optimal
in terms of dynamic regret. Specifically, our algorithm achieves $\mathcal {O}(\min\{\sqrt …

Kaydet Alıntı yap Alıntılanma sayısı: 145 İlgili makaleler 7 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] neurips.cc

Adapting to online label shift with provable guarantees

Y Bai, YJ Zhang, P Zhao… - Advances in Neural …, 2022 - proceedings.neurips.cc

The standard supervised learning paradigm works effectively when training data shares the
same distribution as the upcoming testing samples. However, this stationary assumption is …

Kaydet Alıntı yap Alıntılanma sayısı: 30 İlgili makaleler 9 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Distributed online optimization for multi-agent networks with coupled inequality constraints

X Li, X Yi, L **e - IEEE Transactions on Automatic Control, 2020 - ieeexplore.ieee.org

This article investigates the distributed online optimization problem over a multi-agent
network subject to local set constraints and coupled inequality constraints, which has a lot of …

Kaydet Alıntı yap Alıntılanma sayısı: 132 İlgili makaleler 5 sürümün hepsi

[Free GPT-4]

[PDF] neurips.cc

Improved dynamic regret for non-degenerate functions

L Zhang, T Yang, J Yi, R **… - Advances in Neural …, 2017 - proceedings.neurips.cc

Recently, there has been a growing research interest in the analysis of dynamic regret,
which measures the performance of an online learner against a sequence of local …

Kaydet Alıntı yap Alıntılanma sayısı: 139 İlgili makaleler 20 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Distributed bandit online convex optimization with time-varying coupled inequality constraints

X Yi, X Li, T Yang, L **e, T Chai… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

Distributed bandit online convex optimization with time-varying coupled inequality
constraints is considered, motivated by a repeated game between a group of learners and …

Kaydet Alıntı yap Alıntılanma sayısı: 95 İlgili makaleler 7 sürümün hepsi

Uyarı oluştur

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

Tracking slowly moving clairvoyant: Optimal dynamic regret of online learning with true and...

A survey on distributed online optimization and online games

Online meta-learning

Distributed online optimization in dynamic environments using mirror descent

Dynamic regret of convex and smooth functions

No-regret learning in time-varying zero-sum games

A new algorithm for non-stationary contextual bandits: Efficient, optimal and parameter-free

Adapting to online label shift with provable guarantees

Distributed online optimization for multi-agent networks with coupled inequality constraints

Improved dynamic regret for non-degenerate functions

Distributed bandit online convex optimization with time-varying coupled inequality constraints