The programmable data plane: Abstractions, architectures, algorithms, and applications

O Michel, R Bifulco, G Retvari, S Schmid - ACM Computing Surveys …, 2021 - dl.acm.org
Programmable data plane technologies enable the systematic reconfiguration of the low-
level processing steps applied to network packets and are key drivers toward realizing the …

Offloading machine learning to programmable data planes: A systematic survey

R Parizotto, BL Coelho, DC Nunes, I Haque… - ACM Computing …, 2023 - dl.acm.org
The demand for machine learning (ML) has increased significantly in recent decades,
enabling several applications, such as speech recognition, computer vision, and …

Scaling distributed machine learning with {In-Network} aggregation

A Sapio, M Canini, CY Ho, J Nelson, P Kalnis… - … USENIX Symposium on …, 2021 - usenix.org
Training machine learning models in parallel is an increasingly important workload. We
accelerate distributed parallel training by designing a communication primitive that uses a …

{ATP}: In-network aggregation for multi-tenant learning

CL Lao, Y Le, K Mahajan, Y Chen, W Wu… - … USENIX Symposium on …, 2021 - usenix.org
Distributed deep neural network training (DT) systems are widely deployed in clusters where
the network is shared across multiple tenants, ie, multiple DT jobs. Each DT job computes …

In-network computation is a dumb idea whose time has come

A Sapio, I Abdelaziz, A Aldilaijan, M Canini… - Proceedings of the 16th …, 2017 - dl.acm.org
Programmable data plane hardware creates new opportunities for infusing intelligence into
the network. This raises a fundamental question: what kinds of computation should be …

Netpaxos: Consensus at network speed

HT Dang, D Sciascia, M Canini, F Pedone… - Proceedings of the 1st …, 2015 - dl.acm.org
This paper explores the possibility of implementing the widely deployed Paxos consensus
protocol in network devices. We present two different approaches:(i) a detailed design …

Incbricks: Toward in-network computation with an in-network cache

M Liu, L Luo, J Nelson, L Ceze… - Proceedings of the …, 2017 - dl.acm.org
The emergence of programmable network devices and the increasing data traffic of
datacenters motivate the idea of in-network computation. By offloading compute operations …

Scalable hierarchical aggregation protocol (SHArP): A hardware architecture for efficient data reduction

RL Graham, D Bureddy, P Lui… - … in HPC (COMHPC), 2016 - ieeexplore.ieee.org
Increased system size and a greater reliance on utilizing system parallelism to achieve
computational needs, requires innovative system architectures to meet the simulation …

In-network aggregation for shared machine learning clusters

N Gebara, M Ghobadi, P Costa - Proceedings of Machine …, 2021 - proceedings.mlsys.org
We present PANAMA, a network architecture for machine learning (ML) workloads on
shared clusters where a variety of training jobs co-exist. PANAMA consists of two key …

Shieldbox: Secure middleboxes using shielded execution

B Trach, A Krohmer, F Gregor, S Arnautov… - Proceedings of the …, 2018 - dl.acm.org
Middleboxes that process confidential data cannot be securely deployed in untrusted cloud
environments. To securely outsource middleboxes to the cloud, state-of-the-art systems …