5G support for industrial IoT applications—challenges, solutions, and research gaps

P Varga, J Peto, A Franko, D Balla, D Haja, F Janky… - Sensors, 2020 - mdpi.com
Industrial IoT has special communication requirements, including high reliability, low
latency, flexibility, and security. These are instinctively provided by the 5G mobile …

Empowering azure storage with {RDMA}

W Bai, SS Abdeen, A Agrawal, KK Attre, P Bahl… - … USENIX Symposium on …, 2023 - usenix.org
Given the wide adoption of disaggregated storage in public clouds, networking is the key to
enabling high performance and high reliability in a cloud storage service. In Azure, we …

A taxonomy of live migration management in cloud computing

T He, R Buyya - ACM Computing Surveys, 2023 - dl.acm.org
Cloud Data Centers have become the key infrastructure for providing services. Instance
migration across different computing nodes in edge and cloud computing is essential to …

Disaggregating persistent memory and controlling them remotely: An exploration of passive disaggregated {Key-Value} stores

SY Tsai, Y Shan, Y Zhang - 2020 USENIX Annual Technical Conference …, 2020 - usenix.org
Many datacenters and clouds manage storage systems separately from computing services
for better manageability and resource utilization. These existing disaggregated storage …

A Survey of Storage Systems in the RDMA era

S Ma, T Ma, K Chen, Y Wu - IEEE Transactions on Parallel and …, 2022 - ieeexplore.ieee.org
Remote Direct Memory Access (RDMA) based network devices are increasingly being
deployed in modern data centers. RDMA brings significant performance improvements over …

Transparent {GPU} sharing in container clouds for deep learning workloads

B Wu, Z Zhang, Z Bai, X Liu, X ** - 20th USENIX Symposium on …, 2023 - usenix.org
Containers are widely used for resource management in datacenters. A common practice to
support deep learning (DL) training in container clouds is to statically bind GPUs to …

High-throughput and flexible host networking for accelerated computing

A Skiadopoulos, Z **e, M Zhao, Q Cai… - … USENIX Symposium on …, 2024 - usenix.org
Modern network hardware is able to meet the stringent bandwidth demands of applications
like GPU-accelerated AI. However, existing host network stacks offer a hard tradeoff …

Collie: Finding Performance Anomalies in {RDMA} Subsystems

X Kong, Y Zhu, H Zhou, Z Jiang, J Ye, C Guo… - … USENIX Symposium on …, 2022 - usenix.org
High-speed RDMA networks are getting rapidly adopted in the industry for their low latency
and reduced CPU overheads. To verify that RDMA can be used in production, system …

Slim:{OS} Kernel Support for a {Low-Overhead} Container Overlay Network

D Zhuo, K Zhang, Y Zhu, HH Liu, M Rockett… - … USENIX Symposium on …, 2019 - usenix.org
Containers have become the de facto method for hosting large-scale distributed
applications. Container overlay networks are essential to providing portability for containers …

Understanding {RDMA} microarchitecture resources for performance isolation

X Kong, J Chen, W Bai, Y Xu, M Elhaddad… - … USENIX Symposium on …, 2023 - usenix.org
Recent years have witnessed the wide adoption of RDMA in the cloud to accelerate first-
party workloads and achieve cost savings by freeing up CPU cycles. Now cloud providers …