Offloading machine learning to programmable data planes: A systematic survey
The demand for machine learning (ML) has increased significantly in recent decades,
enabling several applications, such as speech recognition, computer vision, and …
enabling several applications, such as speech recognition, computer vision, and …
In-network aggregation with transport transparency for distributed training
Recent In-Network Aggregation (INA) solutions offload the all-reduce operation onto network
switches to accelerate and scale distributed training (DT). On end hosts, these solutions …
switches to accelerate and scale distributed training (DT). On end hosts, these solutions …
NetFC: Enabling accurate floating-point arithmetic on programmable switches
Programmable switches are recently used for accelerating data-intensive distributed
applications. Some computational tasks, traditionally performed on servers in data centers …
applications. Some computational tasks, traditionally performed on servers in data centers …
P4db-the case for in-network oltp
In this paper we present a new approach for distributed DBMSs called P4DB, that uses a
programmable switch to accelerate OLTP workloads. The main idea of P4DB is that it …
programmable switch to accelerate OLTP workloads. The main idea of P4DB is that it …
A roadmap for big model
With the rapid development of deep learning, training Big Models (BMs) for multiple
downstream tasks becomes a popular paradigm. Researchers have achieved various …
downstream tasks becomes a popular paradigm. Researchers have achieved various …
Efficient data-plane memory scheduling for in-network aggregation
As the scale of distributed training grows, communication becomes a bottleneck. To
accelerate the communication, recent works introduce In-Network Aggregation (INA), which …
accelerate the communication, recent works introduce In-Network Aggregation (INA), which …
Associative memory based experience replay for deep reinforcement learning
Experience replay is an essential component in deep reinforcement learning (DRL), which
stores the experiences and generates experiences for the agent to learn in real time …
stores the experiences and generates experiences for the agent to learn in real time …
Emotion detection in instagram social media platform
LJ Sailesh, VK Kumar, K Nimala… - 2023 International …, 2023 - ieeexplore.ieee.org
Depression is regarded as an important issue because it is the largest cause of disability
around the world and a major factor in the formation of serious medical conditions, which …
around the world and a major factor in the formation of serious medical conditions, which …
Preemptive switch memory usage to accelerate training jobs with shared in-network aggregation
Recent works introduce In-Network Aggregation (INA) for distributed training (DT), which
moves the gradient summation into network programmable switches. INA can reduce the …
moves the gradient summation into network programmable switches. INA can reduce the …
NetReduce: RDMA-compatible in-network reduction for distributed DNN training acceleration
We present NetReduce, a novel RDMA-compatible in-network reduction architecture to
accelerate distributed DNN training. Compared to existing designs, NetReduce maintains a …
accelerate distributed DNN training. Compared to existing designs, NetReduce maintains a …