INSERT: In-Network Stateful End-to-End RDMA Telemetry

H Chang, WA Hanafy, S Mukherjee… - IEEE INFOCOM 2024 …, 2024 - ieeexplore.ieee.org
Remote Direct Memory Access (RDMA) has been widely adopted in modern data centers
thanks to its high-throughput, low-latency data transfer capability and reduced CPU …

Zeta: Transparent Zero-Trust Security Add-on for RDMA

H Chang, S Mukherjee - IEEE INFOCOM 2024-IEEE …, 2024 - ieeexplore.ieee.org
While the fast adoption of RDMA in data centers has been primarily driven by its
performance benefits, more and more attention is being paid to its security implication …

Enhancing Resilience in Distributed ML Inference Pipelines for Edge Computing

L Wu, WA Hanafy, A Souza… - MILCOM 2024-2024 …, 2024 - ieeexplore.ieee.org
As edge computing and sensing devices continue to proliferate, distributed machine
learning (ML) inference pipelines are becoming popular for enabling low-latency, real-time …

Failure-Resilient ML Inference at the Edge through Graceful Service Degradation

WA Hanafy, L Wu, T Abdelzaher… - MILCOM 2023-2023 …, 2023 - ieeexplore.ieee.org
With recent innovations in machine learning (ML) technologies, especially deep learning,
many IoT applications have increasingly relied on ML models for various tasks, such as …

Toward Highly-efficient GPU-centric Networking

M Girondi - 2024 - diva-portal.org
Graphics Processing Units (GPUs) are emerging as the most popular accelerator for many
applications, powering the core of Machine Learning applications and many computing …

[PDF][PDF] Экспериментальная оценка результатов внедрения технологии NVIDIA GPUDirect на суперкомпьютере НИУ ВШЭ

РА Чулкевич, ВИ Козырев, ПС Костенецкий… - Москва, 2023 - publications.hse.ru
Оптимизация использования вычислительных ресурсов на высокопроизводительных
кластерах является важной задачей в условиях высокой загрузки. Одним из способов …