A survey on data center networking (DCN): Infrastructure and operations
Data centers (DCs), owing to the exponential growth of Internet services, have emerged as
an irreplaceable and crucial infrastructure to power this ever-growing trend. A DC typically …
an irreplaceable and crucial infrastructure to power this ever-growing trend. A DC typically …
Datacenter traffic control: Understanding techniques and tradeoffs
M Noormohammadpour… - … Surveys & Tutorials, 2017 - ieeexplore.ieee.org
Datacenters provide cost-effective and flexible access to scalable compute and storage
resources necessary for today's cloud computing needs. A typical datacenter is made up of …
resources necessary for today's cloud computing needs. A typical datacenter is made up of …
Swift: Delay is simple and effective for congestion control in the datacenter
We report on experiences with Swift congestion control in Google datacenters. Swift targets
an end-to-end delay by using AIMD control, with pacing under extreme congestion. With …
an end-to-end delay by using AIMD control, with pacing under extreme congestion. With …
Fast distributed inference serving for large language models
Large language models (LLMs) power a new generation of interactive AI applications
exemplified by ChatGPT. The interactive nature of these applications demands low latency …
exemplified by ChatGPT. The interactive nature of these applications demands low latency …
Homa: A receiver-driven low-latency transport protocol using network priorities
Homa is a new transport protocol for datacenter networks. It provides exceptionally low
latency, especially for workloads with a high volume of very short messages, and it also …
latency, especially for workloads with a high volume of very short messages, and it also …
Re-architecting datacenter networks and stacks for low latency and high performance
Modern datacenter networks provide very high capacity via redundant Clos topologies and
low switch latency, but transport protocols rarely deliver matching performance. We present …
low switch latency, but transport protocols rarely deliver matching performance. We present …
Auto: Scaling deep reinforcement learning for datacenter-scale automatic traffic optimization
Traffic optimizations (TO, eg flow scheduling, load balancing) in datacenters are difficult
online decision-making problems. Previously, they are done with heuristics relying on …
online decision-making problems. Previously, they are done with heuristics relying on …
TIMELY: RTT-based congestion control for the datacenter
Datacenter transports aim to deliver low latency messaging together with high throughput.
We show that simple packet delay, measured as round-trip times at hosts, is an effective …
We show that simple packet delay, measured as round-trip times at hosts, is an effective …
CONGA: Distributed congestion-aware load balancing for datacenters
M Alizadeh, T Edsall, S Dharmapurikar… - Proceedings of the …, 2014 - dl.acm.org
We present the design, implementation, and evaluation of CONGA, a network-based
distributed congestion-aware load balancing mechanism for datacenters. CONGA exploits …
distributed congestion-aware load balancing mechanism for datacenters. CONGA exploits …
pFabric: Minimal near-optimal datacenter transport
In this paper we present pFabric, a minimalistic datacenter transport design that provides
near theoretically optimal flow completion times even at the 99th percentile for short flows …
near theoretically optimal flow completion times even at the 99th percentile for short flows …