- Academic Search

M Assran, A Aytekin, HR Feyzmahdavian… - Proceedings of the …, 2020 - ieeexplore.ieee.org

Motivated by large-scale optimization problems arising in the context of machine learning,
there have been several advances in the study of asynchronous parallel and distributed …

Enregistrer Citer Cité 107 fois Autres articles Les 4 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

First analysis of local GD on heterogeneous data

A Khaled, K Mishchenko, P Richtárik - arxiv preprint arxiv:1909.04715, 2019 - arxiv.org

We provide the first convergence analysis of local gradient descent for minimizing the
average of smooth and convex but otherwise arbitrary functions. Problems of this form and …

Enregistrer Citer Cité 192 fois Autres articles Les 6 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] neurips.cc

Asynchronous SGD beats minibatch SGD under arbitrary delays

K Mishchenko, F Bach, M Even… - Advances in Neural …, 2022 - proceedings.neurips.cc

The existing analysis of asynchronous stochastic gradient descent (SGD) degrades
dramatically when any delay is large, giving the impression that performance depends …

Enregistrer Citer Cité 58 fois Autres articles Les 9 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] springer.com

Predicting dynamic spectrum allocation: a review covering simulation, modelling, and prediction

AC Cullen, BIP Rubinstein, S Kandeepan… - Artificial Intelligence …, 2023 - Springer

The advent of the Internet of Things and 5G has further accelerated the growth in devices
attempting to gain access to the wireless spectrum. A consequence of this has been the …

Enregistrer Citer Cité 7 fois Autres articles Les 5 versions Free GPT-4

[Free GPT-4]

[PDF] mlr.press

Asynchronous sgd on graphs: a unified framework for asynchronous decentralized and federated optimization

M Even, A Koloskova… - … Conference on Artificial …, 2024 - proceedings.mlr.press

Decentralized and asynchronous communications are two popular techniques to speedup
communication complexity of distributed machine learning, by respectively removing the …

Enregistrer Citer Cité 13 fois Autres articles Les 4 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Stochastic Newton and cubic Newton methods with simple local linear-quadratic rates

D Kovalev, K Mishchenko, P Richtárik - arxiv preprint arxiv:1912.01597, 2019 - arxiv.org

We present two new remarkably simple stochastic second-order methods for minimizing the
average of a very large number of sufficiently smooth and strongly convex functions. The first …

Enregistrer Citer Cité 50 fois Autres articles Les 4 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] neurips.cc

Moshpit sgd: Communication-efficient decentralized training on heterogeneous unreliable devices

M Ryabinin, E Gorbunov… - Advances in …, 2021 - proceedings.neurips.cc

Training deep neural networks on large datasets can often be accelerated by using multiple
compute nodes. This approach, known as distributed training, can utilize hundreds of …

Enregistrer Citer Cité 45 fois Autres articles Les 11 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] neurips.cc

Optimal time complexities of parallel stochastic optimization methods under a fixed computation model

A Tyurin, P Richtárik - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Parallelization is a popular strategy for improving the performance of methods. Optimization
methods are no exception: design of efficient parallel optimization methods and tight …

Enregistrer Citer Cité 10 fois Autres articles Les 8 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Adaptive catalyst for smooth convex optimization

A Ivanova, D Pasechnyuk, D Grishchenko… - … on Optimization and …, 2021 - Springer

In this paper, we present a generic framework that allows accelerating almost arbitrary non-
accelerated deterministic and randomized algorithms for smooth convex optimization …

Enregistrer Citer Cité 37 fois Autres articles Les 11 versions Free GPT-4

[Free GPT-4]

[PDF] jmlr.org

Robust distributed accelerated stochastic gradient methods for multi-agent networks

A Fallah, M Gürbüzbalaban, A Ozdaglar… - Journal of machine …, 2022 - jmlr.org

We study distributed stochastic gradient (D-SG) method and its accelerated variant (D-ASG)
for solving decentralized strongly convex stochastic optimization problems where the …

Enregistrer Citer Cité 33 fois Autres articles Les 8 versions Free GPT-4 Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

A delay-tolerant proximal-gradient algorithm for distributed learning

Advances in asynchronous parallel and distributed optimization

First analysis of local GD on heterogeneous data

Asynchronous SGD beats minibatch SGD under arbitrary delays

Predicting dynamic spectrum allocation: a review covering simulation, modelling, and prediction

Asynchronous sgd on graphs: a unified framework for asynchronous decentralized and federated optimization

Stochastic Newton and cubic Newton methods with simple local linear-quadratic rates

Moshpit sgd: Communication-efficient decentralized training on heterogeneous unreliable devices

Optimal time complexities of parallel stochastic optimization methods under a fixed computation model

Adaptive catalyst for smooth convex optimization

Robust distributed accelerated stochastic gradient methods for multi-agent networks