A survey on scheduling techniques in computing and network convergence

S Tang, Y Yu, H Wang, G Wang, W Chen… - … Surveys & Tutorials, 2023 - ieeexplore.ieee.org
The computing demand for massive applications has led to the ubiquitous deployment of
computing power. This trend results in the urgent need for higher-level computing resource …

Swift: Delay is simple and effective for congestion control in the datacenter

G Kumar, N Dukkipati, K Jang, HMG Wassel… - Proceedings of the …, 2020 - dl.acm.org
We report on experiences with Swift congestion control in Google datacenters. Swift targets
an end-to-end delay by using AIMD control, with pacing under extreme congestion. With …

PINT: Probabilistic in-band network telemetry

R Ben Basat, S Ramanathan, Y Li, G Antichi… - Proceedings of the …, 2020 - dl.acm.org
Commodity network devices support adding in-band telemetry measurements into data
packets, enabling a wide range of applications, including network troubleshooting …

In-band network telemetry: A survey

L Tan, W Su, W Zhang, J Lv, Z Zhang, J Miao, X Liu… - Computer Networks, 2021 - Elsevier
With the development of software-defined network and programmable data-plane
technology, in-band network telemetry has emerged. In-band network telemetry technology …

Classic meets modern: A pragmatic learning-based congestion control for the internet

S Abbasloo, CY Yen, HJ Chao - … of the Annual conference of the ACM …, 2020 - dl.acm.org
These days, taking the revolutionary approach of using clean-slate learning-based designs
to completely replace the classic congestion control schemes for the Internet is gaining …

An exhaustive survey on p4 programmable data plane switches: Taxonomy, applications, challenges, and future trends

EF Kfoury, J Crichigno, E Bou-Harb - IEEE access, 2021 - ieeexplore.ieee.org
Traditionally, the data plane has been designed with fixed functions to forward packets using
a small set of protocols. This closed-design paradigm has limited the capability of the …

Understanding host network stack overheads

Q Cai, S Chaudhary, M Vuppalapati, J Hwang… - Proceedings of the …, 2021 - dl.acm.org
Traditional end-host network stacks are struggling to keep up with rapidly increasing
datacenter access link bandwidths due to their unsustainable CPU overheads. Motivated by …

Rdma over ethernet for distributed training at meta scale

A Gangidi, R Miao, S Zheng, SJ Bondu… - Proceedings of the …, 2024 - dl.acm.org
The rapid growth in both computational density and scale in AI models in recent years
motivates the construction of an efficient and reliable dedicated network infrastructure. This …

CocoSketch: High-performance sketch-based measurement over arbitrary partial key query

Y Zhang, Z Liu, R Wang, T Yang, J Li, R Miao… - Proceedings of the …, 2021 - dl.acm.org
Sketch-based measurement has emerged as a promising alternative to the traditional
sampling-based network measurement approaches due to its high accuracy and resource …

Flow event telemetry on programmable data plane

Y Zhou, C Sun, HH Liu, R Miao, S Bai, B Li… - Proceedings of the …, 2020 - dl.acm.org
Network performance anomalies (NPAs), eg long-tailed latency, bandwidth decline, etc., are
increasingly crucial to cloud providers as applications are getting more sensitive to …