A comprehensive survey on coded distributed computing: Fundamentals, challenges, and networking applications
Distributed computing has become a common approach for large-scale computation tasks
due to benefits such as high reliability, scalability, computation speed, and cost …
due to benefits such as high reliability, scalability, computation speed, and cost …
Speeding up distributed machine learning using codes
Codes are widely used in many engineering applications to offer robustness against noise.
In large-scale systems, there are several types of noise that can affect the performance of …
In large-scale systems, there are several types of noise that can affect the performance of …
Coded computation over heterogeneous clusters
In large-scale distributed computing clusters, such as Amazon EC2, there are several types
of “system noise” that can result in major degradation of performance: system failures …
of “system noise” that can result in major degradation of performance: system failures …
High-dimensional coded matrix multiplication
Coded computation is a framework for providing redundancy in distributed computing
systems to make them robust to slower nodes, or stragglers. In [1], the authors propose a …
systems to make them robust to slower nodes, or stragglers. In [1], the authors propose a …
A unified coded deep neural network training strategy based on generalized polydot codes
This paper has two main contributions. First, we propose a novel coding technique-
Generalized PolyDot-for matrix-vector products that advances on existing techniques for …
Generalized PolyDot-for matrix-vector products that advances on existing techniques for …
Exploitation of stragglers in coded computation
In cloud computing systems slow processing nodes, often referred to as “stragglers”, can
significantly extend the computation time. Recent results have shown that error correction …
significantly extend the computation time. Recent results have shown that error correction …
Hierarchical coded computation
Coded computation is a method to mitigate “stragglers” in distributed computing systems
through the use of error correction coding that has lately received significant attention. First …
through the use of error correction coding that has lately received significant attention. First …
Coded distributed computing for inverse problems
Computationally intensive distributed and parallel computing is often bottlenecked by a
small set of slow workers known as stragglers. In this paper, we utilize the emerging idea …
small set of slow workers known as stragglers. In this paper, we utilize the emerging idea …
On batch-processing based coded computing for heterogeneous distributed computing systems
In recent years, coded distributed computing (CDC) has attracted significant attention,
because it can efficiently facilitate many delay-sensitive computation tasks against …
because it can efficiently facilitate many delay-sensitive computation tasks against …
Coded federated learning for communication-efficient edge computing: A survey
In the era of artificial intelligence and big data, the demand for data processing has surged,
leading to larger datasets and computation capability. Distributed machine learning (DML) …
leading to larger datasets and computation capability. Distributed machine learning (DML) …