In-network machine learning using programmable network devices: A survey

C Zheng, X Hong, D Ding, S Vargaftik… - … Surveys & Tutorials, 2023 - ieeexplore.ieee.org
Machine learning is widely used to solve networking challenges, ranging from traffic
classification and anomaly detection to network configuration. However, machine learning …

Offloading machine learning to programmable data planes: A systematic survey

R Parizotto, BL Coelho, DC Nunes, I Haque… - ACM Computing …, 2023 - dl.acm.org
The demand for machine learning (ML) has increased significantly in recent decades,
enabling several applications, such as speech recognition, computer vision, and …

Scaling distributed machine learning with {In-Network} aggregation

A Sapio, M Canini, CY Ho, J Nelson, P Kalnis… - … USENIX Symposium on …, 2021 - usenix.org
Training machine learning models in parallel is an increasingly important workload. We
accelerate distributed parallel training by designing a communication primitive that uses a …

Cooperative exploration for multi-agent deep reinforcement learning

IJ Liu, U Jain, RA Yeh… - … conference on machine …, 2021 - proceedings.mlr.press
Exploration is critical for good results in deep reinforcement learning and has attracted much
attention. However, existing multi-agent deep reinforcement learning algorithms still use …

Do switches dream of machine learning? toward in-network classification

Z **ong, N Zilberman - Proceedings of the 18th ACM workshop on hot …, 2019 - dl.acm.org
Machine learning is currently driving a technological and societal revolution. While
programmable switches have been proven to be useful for in-network computing, machine …

Efficient sparse collective communication and its application to accelerate distributed deep learning

J Fei, CY Ho, AN Sahu, M Canini, A Sapio - Proceedings of the 2021 …, 2021 - dl.acm.org
Efficient collective communication is crucial to parallel-computing applications such as
distributed training of large-scale recommendation systems and natural language …

IIsy: Practical in-network classification

C Zheng, Z **ong, TT Bui, S Kaupmees… - arxiv preprint arxiv …, 2022 - arxiv.org
The rat race between user-generated data and data-processing systems is currently won by
data. The increased use of machine learning leads to further increase in processing …

Think fast: A tensor streaming processor (TSP) for accelerating deep learning workloads

D Abts, J Ross, J Sparling… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
In this paper, we introduce the Tensor Streaming Processor (TSP) architecture, a functionally-
sliced microarchitecture with memory units interleaved with vector and matrix deep learning …

Programmable switches for in-networking classification

BM Xavier, RS Guimarães, G Comarela… - … -IEEE Conference on …, 2021 - ieeexplore.ieee.org
Deploying accurate machine learning algorithms into a high-throughput networking
environment is a challenging task. On the one hand, machine learning has proved itself …

Line-speed and scalable intrusion detection at the network edge via federated learning

Q Qin, K Poularakis, KK Leung… - 2020 IFIP networking …, 2020 - ieeexplore.ieee.org
Intrusion detection through classifying incoming packets is a crucial functionality at the
network edge, requiring accuracy, efficiency and scalability at the same time, introducing a …