Machine learning at the wireless edge: Distributed stochastic gradient descent over-the-air

MM Amiri, D Gündüz - IEEE Transactions on Signal Processing, 2020 - ieeexplore.ieee.org
We study federated machine learning (ML) at the wireless edge, where power-and
bandwidth-limited wireless devices with local datasets carry out distributed stochastic …

Computation scheduling for distributed machine learning with straggling workers

MM Amiri, D Gündüz - IEEE Transactions on Signal Processing, 2019 - ieeexplore.ieee.org
We study scheduling of computation tasks across n workers in a large scale distributed
learning problem with the help of a master. Computation and communication delays are …

Stochastic gradient coding for straggler mitigation in distributed learning

R Bitar, M Wootters… - IEEE Journal on Selected …, 2020 - ieeexplore.ieee.org
We consider distributed gradient descent in the presence of stragglers. Recent work on
gradient coding and approximate gradient coding have shown how to add redundancy in …

A double auction mechanism for resource allocation in coded vehicular edge computing

JS Ng, WYB Lim, Z **ong, D Niyato… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
The development of smart vehicles and rich cloud services have led to the emergence of
vehicular edge computing. To perform the distributed computation tasks efficiently, Coded …

Coded sparse matrix computation schemes that leverage partial stragglers

AB Das, A Ramamoorthy - IEEE Transactions on Information …, 2022 - ieeexplore.ieee.org
Distributed matrix computations over large clusters can suffer from the problem of slow or
failed worker nodes (called stragglers) which can dominate the overall job execution time …

Coding for large-scale distributed machine learning

M **ao, M Skoglund - Entropy, 2022 - mdpi.com
This article aims to give a comprehensive and rigorous review of the principles and recent
development of coding for large-scale distributed machine learning (DML). With increasing …

Numerically stable coded matrix computations via circulant and rotation matrix embeddings

A Ramamoorthy, L Tang - IEEE Transactions on Information …, 2021 - ieeexplore.ieee.org
Polynomial based methods have recently been used in several works for mitigating the
effect of stragglers (slow or failed nodes) in distributed matrix computations. For a system …

Coded distributed computing with partial recovery

E Ozfatura, S Ulukus, D Gündüz - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Coded computation techniques provide robustness against straggling workers in distributed
computing. However, most of the existing schemes require exact provisioning of the …

Straggler-aware distributed learning: Communication–computation latency trade-off

E Ozfatura, S Ulukus, D Gündüz - Entropy, 2020 - mdpi.com
When gradient descent (GD) is scaled to many parallel workers for large-scale machine
learning applications, its per-iteration computation time is limited by straggling workers …

Universally decodable matrices for distributed matrix-vector multiplication

A Ramamoorthy, L Tang… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
Coded computation is an emerging research area that leverages concepts from erasure
coding to mitigate the effect of stragglers (slow nodes) in distributed computation clusters …