A comprehensive survey on coded distributed computing: Fundamentals, challenges, and networking applications

JS Ng, WYB Lim, NC Luong, Z **ong… - … Surveys & Tutorials, 2021 - ieeexplore.ieee.org
Distributed computing has become a common approach for large-scale computation tasks
due to benefits such as high reliability, scalability, computation speed, and cost …

Speeding up distributed machine learning using codes

K Lee, M Lam, R Pedarsani… - IEEE Transactions …, 2017 - ieeexplore.ieee.org
Codes are widely used in many engineering applications to offer robustness against noise.
In large-scale systems, there are several types of noise that can affect the performance of …

Coded computation over heterogeneous clusters

A Reisizadeh, S Prakash, R Pedarsani… - IEEE Transactions …, 2019 - ieeexplore.ieee.org
In large-scale distributed computing clusters, such as Amazon EC2, there are several types
of “system noise” that can result in major degradation of performance: system failures …

High-dimensional coded matrix multiplication

K Lee, C Suh, K Ramchandran - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Coded computation is a framework for providing redundancy in distributed computing
systems to make them robust to slower nodes, or stragglers. In [1], the authors propose a …

A unified coded deep neural network training strategy based on generalized polydot codes

S Dutta, Z Bai, H Jeong, TM Low… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
This paper has two main contributions. First, we propose a novel coding technique-
Generalized PolyDot-for matrix-vector products that advances on existing techniques for …

Exploitation of stragglers in coded computation

S Kiani, N Ferdinand, SC Draper - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
In cloud computing systems slow processing nodes, often referred to as “stragglers”, can
significantly extend the computation time. Recent results have shown that error correction …

Hierarchical coded computation

N Ferdinand, SC Draper - 2018 IEEE International Symposium …, 2018 - ieeexplore.ieee.org
Coded computation is a method to mitigate “stragglers” in distributed computing systems
through the use of error correction coding that has lately received significant attention. First …

Coded distributed computing for inverse problems

Y Yang, P Grover, S Kar - Advances in Neural Information …, 2017 - proceedings.neurips.cc
Computationally intensive distributed and parallel computing is often bottlenecked by a
small set of slow workers known as stragglers. In this paper, we utilize the emerging idea …

On batch-processing based coded computing for heterogeneous distributed computing systems

B Wang, J **e, K Lu, Y Wan, S Fu - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In recent years, coded distributed computing (CDC) has attracted significant attention,
because it can efficiently facilitate many delay-sensitive computation tasks against …

Coded federated learning for communication-efficient edge computing: A survey

Y Zhang, T Gao, C Li, CW Tan - IEEE Open Journal of the …, 2024 - ieeexplore.ieee.org
In the era of artificial intelligence and big data, the demand for data processing has surged,
leading to larger datasets and computation capability. Distributed machine learning (DML) …