Networking for big data: A survey
Complementary to the fancy big data applications, networking for big data is an
indispensable supporting platform for these applications in practice. This emerging research …
indispensable supporting platform for these applications in practice. This emerging research …
Load balancing in data center networks: A survey
Data center networks usually employ the scale-out model to provide high bisection
bandwidth for applications. A large amount of data is required to be transferred frequently …
bandwidth for applications. A large amount of data is required to be transferred frequently …
Efficient coflow scheduling without prior knowledge
Inter-coflow scheduling improves application-level communication performance in data-
parallel clusters. However, existing efficient schedulers require a priori coflow information …
parallel clusters. However, existing efficient schedulers require a priori coflow information …
CODA: Toward automatically identifying and scheduling coflows in the dark
Leveraging application-level requirements using coflows has recently been shown to
improve application-level communication performance in data-parallel clusters. However …
improve application-level communication performance in data-parallel clusters. However …
Sincronia: Near-optimal network design for coflows
We present Sincronia, a near-optimal network design for coflows that can be implemented
on top on any transport layer (for flows) that supports priority scheduling. Sincronia achieves …
on top on any transport layer (for flows) that supports priority scheduling. Sincronia achieves …
Network-aware locality scheduling for distributed data operators in data centers
Large data centers are currently the mainstream infrastructures for big data processing. As
one of the most fundamental tasks in these environments, the efficient execution of …
one of the most fundamental tasks in these environments, the efficient execution of …
Loom: Flexible and efficient {NIC} packet scheduling
In multi-tenant cloud data centers, operators need to ensure that competing tenants and
applications are isolated from each other and fairly share limited network resources. With …
applications are isolated from each other and fairly share limited network resources. With …
[PDF][PDF] Deepweave: Accelerating job completion time with deep reinforcement learning-based coflow scheduling
P Sun, Z Guo, J Wang, J Li, J Lan, Y Hu - Proceedings of the Twenty-Ninth …, 2021 - ijcai.org
To improve the processing efficiency of jobs in distributed computing, the concept of coflow
is proposed. A coflow is a collection of flows that are semantically correlated in a multi-stage …
is proposed. A coflow is a collection of flows that are semantically correlated in a multi-stage …
Cluster frameworks for efficient scheduling and resource allocation in data center networks: A survey
Data centers are widely used for big data analytics, which often involve data-parallel jobs,
including query and web service. Meanwhile, cluster frameworks are rapidly developed for …
including query and web service. Meanwhile, cluster frameworks are rapidly developed for …
Towards real-time inference offloading with distributed edge computing: the framework and algorithms
By combining edge computing and parallel computing, distributed edge computing has
emerged as a new paradigm to exploit the booming IoT devices at the edge. To accelerate …
emerged as a new paradigm to exploit the booming IoT devices at the edge. To accelerate …