Machine learning at the wireless edge: Distributed stochastic gradient descent over-the-air
We study federated machine learning (ML) at the wireless edge, where power-and
bandwidth-limited wireless devices with local datasets carry out distributed stochastic …
bandwidth-limited wireless devices with local datasets carry out distributed stochastic …
Computation scheduling for distributed machine learning with straggling workers
We study scheduling of computation tasks across n workers in a large scale distributed
learning problem with the help of a master. Computation and communication delays are …
learning problem with the help of a master. Computation and communication delays are …
Stochastic gradient coding for straggler mitigation in distributed learning
We consider distributed gradient descent in the presence of stragglers. Recent work on
gradient coding and approximate gradient coding have shown how to add redundancy in …
gradient coding and approximate gradient coding have shown how to add redundancy in …
A double auction mechanism for resource allocation in coded vehicular edge computing
The development of smart vehicles and rich cloud services have led to the emergence of
vehicular edge computing. To perform the distributed computation tasks efficiently, Coded …
vehicular edge computing. To perform the distributed computation tasks efficiently, Coded …
Coded sparse matrix computation schemes that leverage partial stragglers
Distributed matrix computations over large clusters can suffer from the problem of slow or
failed worker nodes (called stragglers) which can dominate the overall job execution time …
failed worker nodes (called stragglers) which can dominate the overall job execution time …
Coding for large-scale distributed machine learning
This article aims to give a comprehensive and rigorous review of the principles and recent
development of coding for large-scale distributed machine learning (DML). With increasing …
development of coding for large-scale distributed machine learning (DML). With increasing …
Numerically stable coded matrix computations via circulant and rotation matrix embeddings
Polynomial based methods have recently been used in several works for mitigating the
effect of stragglers (slow or failed nodes) in distributed matrix computations. For a system …
effect of stragglers (slow or failed nodes) in distributed matrix computations. For a system …
Coded distributed computing with partial recovery
Coded computation techniques provide robustness against straggling workers in distributed
computing. However, most of the existing schemes require exact provisioning of the …
computing. However, most of the existing schemes require exact provisioning of the …
Straggler-aware distributed learning: Communication–computation latency trade-off
When gradient descent (GD) is scaled to many parallel workers for large-scale machine
learning applications, its per-iteration computation time is limited by straggling workers …
learning applications, its per-iteration computation time is limited by straggling workers …
Universally decodable matrices for distributed matrix-vector multiplication
Coded computation is an emerging research area that leverages concepts from erasure
coding to mitigate the effect of stragglers (slow nodes) in distributed computation clusters …
coding to mitigate the effect of stragglers (slow nodes) in distributed computation clusters …