- Academic Search

D Jhunjhunwala, P Sharma… - Uncertainty in …, 2022 - proceedings.mlr.press

Data-heterogeneous federated learning (FL) systems suffer from two significant sources of
convergence error: 1) client drift error caused by performing multiple local optimization steps …

Gem Citer Citeret af 60 Relaterede artikler Alle 6 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

EF21 with bells & whistles: Practical algorithmic extensions of modern error feedback

I Fatkhullin, I Sokolov, E Gorbunov, Z Li… - ar** for non-convex optimization

A Reisizadeh, H Li, S Das, A Jadbabaie - ar** is a standard training technique used in deep learning applications such
as large-scale language modeling to mitigate exploding gradients. Recent experimental …

Gem Citer Citeret af 27 Relaterede artikler Alle 2 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Decentralized stochastic gradient descent ascent for finite-sum minimax problems

H Gao - arxiv preprint arxiv:2212.02724, 2022 - arxiv.org

Minimax optimization problems have attracted significant attention in recent years due to
their widespread application in numerous machine learning models. To solve the minimax …

Gem Citer Citeret af 17 Relaterede artikler Alle 2 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

CANITA: Faster rates for distributed convex optimization with communication compression

Z Li, P Richtárik - Advances in Neural Information …, 2021 - proceedings.neurips.cc

Due to the high communication cost in distributed and federated learning, methods relying
on compressed communication are becoming increasingly popular. Besides, the best …

Gem Citer Citeret af 32 Relaterede artikler Alle 13 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

DASHA: Distributed nonconvex optimization with communication compression, optimal oracle complexity, and no client synchronization

A Tyurin, P Richtárik - arxiv preprint arxiv:2202.01268, 2022 - arxiv.org

We develop and analyze DASHA: a new family of methods for nonconvex distributed
optimization problems. When the local functions at the nodes have a finite-sum or an …

Gem Citer Citeret af 31 Relaterede artikler Alle 5 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

FedPAGE: A fast local stochastic gradient method for communication-efficient federated learning

H Zhao, Z Li, P Richtárik - arxiv preprint arxiv:2108.04755, 2021 - arxiv.org

Federated Averaging (FedAvg, also known as Local-SGD)(McMahan et al., 2017) is a
classical federated learning algorithm in which clients run multiple local SGD steps before …

Gem Citer Citeret af 31 Relaterede artikler Alle 7 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Simple and optimal stochastic gradient methods for nonsmooth nonconvex optimization

Z Li, J Li - Journal of Machine Learning Research, 2022 - jmlr.org

We propose and analyze several stochastic gradient algorithms for finding stationary points
or local minimum in nonconvex, possibly with nonsmooth regularizer, finite-sum and online …

Gem Citer Citeret af 10 Relaterede artikler Alle 5 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Jointly improving the sample and communication complexities in decentralized stochastic minimax optimization

X Zhang, G Mancino-Ball, NS Aybat, Y Xu - Proceedings of the AAAI …, 2024 - ojs.aaai.org

We propose a novel single-loop decentralized algorithm, DGDA-VR, for solving the
stochastic nonconvex strongly-concave minimax problems over a connected network of …

Gem Citer Citeret af 5 Relaterede artikler Alle 5 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] nsf.gov

DESTRESS: Computation-optimal and communication-efficient decentralized nonconvex finite-sum optimization

B Li, Z Li, Y Chi - SIAM Journal on Mathematics of Data Science, 2022 - SIAM

Emerging applications in multiagent environments such as internet-of-things, networked
sensing, autonomous systems, and federated learning, call for decentralized algorithms for …

Gem Citer Citeret af 24 Relaterede artikler Alle 8 versioner

Opret underretning

Citer

Avanceret søgning

Gemt i Min samling

ZeroSARAH: Efficient nonconvex finite-sum optimization with zero full gradient computation

Fedvarp: Tackling the variance due to partial client participation in federated learning

EF21 with bells & whistles: Practical algorithmic extensions of modern error feedback

Decentralized stochastic gradient descent ascent for finite-sum minimax problems

CANITA: Faster rates for distributed convex optimization with communication compression

DASHA: Distributed nonconvex optimization with communication compression, optimal oracle complexity, and no client synchronization

FedPAGE: A fast local stochastic gradient method for communication-efficient federated learning

Simple and optimal stochastic gradient methods for nonsmooth nonconvex optimization

Jointly improving the sample and communication complexities in decentralized stochastic minimax optimization

DESTRESS: Computation-optimal and communication-efficient decentralized nonconvex finite-sum optimization