Flagger: Cooperative acceleration for large-scale cross-silo federated learning aggregation

X Pan, Y An, S Liang, B Mao, M Zhang… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Cross-silo federated learning (FL) leverages homomorphic encryption (HE) to obscure the
model updates from the clients. However, HE poses the challenges of complex …

Safebpf: Hardware-assisted defense-in-depth for ebpf kernel extensions

SY Lim, T Prasad, X Han, T Pasquier - … of the 2024 on Cloud Computing …, 2024 - dl.acm.org
The eBPF framework enables execution of user-provided code in the Linux kernel. In the last
few years, a large ecosystem of cloud services has leveraged eBPF to enhance container …

{ScalaCache}: Scalable {User-Space} Page Cache Management with {Software-Hardware} Coordination

L Peng, Y An, Y Zhou, C Wang, Q Li, C Cheng… - 2024 USENIX Annual …, 2024 - usenix.org
Due to the host-centric design principle, the existing page cache management suffers from
CPU consumption, communication costs, and garbage collection (GC) interference. To …

Understanding Performance of eBPF Maps

C Liu, B Tak, L Wang - Proceedings of the ACM SIGCOMM 2024 …, 2024 - dl.acm.org
The Linux community has witnessed the rapid development of eBPF technology that allows
users to load custom programs into the Linux kernel to extend its capabilities. A key feature …

{OmniCache}: Collaborative Caching for Near-storage Accelerators

J Zhang, Y Ren, M Nguyen, C Min… - 22nd USENIX Conference …, 2024 - usenix.org
We propose OmniCache, a novel caching design for near-storage accelerators that
combines near-storage and host memory capabilities to accelerate I/O and data processing …

ProckStore: An NDP-empowered key-value store with asynchronous and multi-threaded compaction scheme for optimized performance

H Sun, C Zhao, Y Yue, X Qin - Journal of Systems Architecture, 2025 - Elsevier
With the exponential growth of large-scale unstructured data, LSM-tree-based key–value
(KV) stores have become increasingly prevalent in storage systems. However, KV stores …

Storage Abstractions for SSDs: The Past, Present, and Future

X Zhang, J Bhimani, S Pei, E Lee, S Lee… - ACM Transactions on …, 2025 - dl.acm.org
This article traces the evolution of SSD (solid-state drive) interfaces, examining the transition
from the block storage paradigm inherited from hard disk drives to SSD-specific standards …

Kgent: Kernel extensions large language model agent

Y Zheng, Y Yang, M Chen, A Quinn - Proceedings of the ACM SIGCOMM …, 2024 - dl.acm.org
The extended Berkeley Packet Filters (eBPF) ecosystem allows for the extension of Linux
and Windows kernels, but writing eBPF programs is challenging due to the required …

DeepTM: Efficient Tensor Management in Heterogeneous Memory for DNN Training

H Zhou, W Rang, H Chen, X Zhou… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) have gained widespread adoption in diverse fields,
including image classification, object detection, and natural language processing. However …

Low-overhead General-purpose Near-Data Processing in CXL Memory Expanders

H Ham, J Hong, G Park, Y Shin, O Woo, W Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
To overcome the memory capacity wall of large-scale AI and big data applications, Compute
Express Link (CXL) enables cost-efficient memory expansion beyond the local DRAM of …