The programmable data plane: Abstractions, architectures, algorithms, and applications

O Michel, R Bifulco, G Retvari, S Schmid - ACM Computing Surveys …, 2021 - dl.acm.org
Programmable data plane technologies enable the systematic reconfiguration of the low-
level processing steps applied to network packets and are key drivers toward realizing the …

Empowering cloud computing with network acceleration: a survey

L Rosa, L Foschini, A Corradi - IEEE Communications Surveys …, 2024 - ieeexplore.ieee.org
Modern interactive and data-intensive applications must operate under demanding time
constraints, prompting a shift toward the adoption of specialized software and hardware …

Direct access,{High-Performance} memory disaggregation with {DirectCXL}

D Gouk, S Lee, M Kwon, M Jung - 2022 USENIX Annual Technical …, 2022 - usenix.org
New cache coherent interconnects such as CXL have recently attracted great attention
thanks to their excellent hardware heterogeneity management and resource disaggregation …

A unified architecture for accelerating distributed {DNN} training in heterogeneous {GPU/CPU} clusters

Y Jiang, Y Zhu, C Lan, B Yi, Y Cui, C Guo - 14th USENIX Symposium on …, 2020 - usenix.org
Data center clusters that run DNN training jobs are inherently heterogeneous. They have
GPUs and CPUs for computation and network bandwidth for distributed training. However …

Netcache: Balancing key-value stores with fast in-network caching

X **, X Li, H Zhang, R Soulé, J Lee, N Foster… - Proceedings of the 26th …, 2017 - dl.acm.org
We present NetCache, a new key-value store architecture that leverages the power and
flexibility of new-generation programmable switches to handle queries on hot items and …

Clio: A hardware-software co-designed disaggregated memory system

Z Guo, Y Shan, X Luo, Y Huang, Y Zhang - Proceedings of the 27th ACM …, 2022 - dl.acm.org
Memory disaggregation has attracted great attention recently because of its benefits in
efficient memory utilization and ease of management. So far, memory disaggregation …

Shenango: Achieving high {CPU} efficiency for latency-sensitive datacenter workloads

A Ousterhout, J Fried, J Behrens, A Belay… - … USENIX Symposium on …, 2019 - usenix.org
Datacenter applications demand microsecond-scale tail latencies and high request rates
from operating systems, and most applications handle loads that have high variance over …

{AIFM}:{High-Performance},{Application-Integrated} far memory

Z Ruan, M Schwarzkopf, MK Aguilera… - 14th USENIX Symposium …, 2020 - usenix.org
Memory is the most contended and least elastic resource in datacenter servers today.
Applications can use only local memory—which may be scarce—even though memory …

Datacenter {RPCs} can be general and fast

A Kalia, M Kaminsky, D Andersen - 16th USENIX Symposium on …, 2019 - usenix.org
It is commonly believed that datacenter networking software must sacrifice generality to
attain high performance. The popularity of specialized distributed systems designed …

The demikernel datapath os architecture for microsecond-scale datacenter systems

I Zhang, A Raybuck, P Patel, K Olynyk… - Proceedings of the …, 2021 - dl.acm.org
Datacenter systems and I/O devices now run at single-digit microsecond latencies, requiring
ns-scale operating systems. Traditional kernel-based operating systems impose an …