Pytorch: An imperative style, high-performance deep learning library

A Paszke, S Gross, F Massa, A Lerer… - Advances in neural …, 2019 - proceedings.neurips.cc
Deep learning frameworks have often focused on either usability or speed, but not both.
PyTorch is a machine learning library that shows that these two goals are in fact compatible …

{NOVA}: A log-structured file system for hybrid {Volatile/Non-volatile} main memories

J Xu, S Swanson - 14th USENIX Conference on File and Storage …, 2016 - usenix.org
Fast non-volatile memories (NVMs) will soon appear on the processor memory bus
alongside DRAM. The resulting hybrid memory systems will provide software with sub …

The demikernel datapath os architecture for microsecond-scale datacenter systems

I Zhang, A Raybuck, P Patel, K Olynyk… - Proceedings of the …, 2021 - dl.acm.org
Datacenter systems and I/O devices now run at single-digit microsecond latencies, requiring
ns-scale operating systems. Traditional kernel-based operating systems impose an …

{FaRM}: Fast remote memory

A Dragojević, D Narayanan, M Castro… - 11th USENIX Symposium …, 2014 - usenix.org
We describe the design and implementation of FaRM, a new main memory distributed
computing platform that exploits RDMA to improve both latency and throughput by an order …

Flatstore: An efficient log-structured key-value storage engine for persistent memory

Y Chen, Y Lu, F Yang, Q Wang, Y Wang… - Proceedings of the Twenty …, 2020 - dl.acm.org
Emerging hardware like persistent memory (PM) and high-speed NICs are promising to
build efficient key-value stores. However, we observe that the small-sized access pattern in …

A lightweight infrastructure for graph analytics

D Nguyen, A Lenharth, K **ali - Proceedings of the twenty-fourth ACM …, 2013 - dl.acm.org
Several domain-specific languages (DSLs) for parallel graph analytics have been proposed
recently. In this paper, we argue that existing DSLs can be implemented on top of a general …

The RAMCloud storage system

J Ousterhout, A Gopalan, A Gupta, A Kejriwal… - ACM Transactions on …, 2015 - dl.acm.org
RAMCloud is a storage system that provides low-latency access to large-scale datasets. To
achieve low latency, RAMCloud stores all data in DRAM at all times. To support large …

Mnemosyne: Lightweight persistent memory

H Volos, AJ Tack, MM Swift - ACM SIGARCH Computer Architecture …, 2011 - dl.acm.org
New storage-class memory (SCM) technologies, such as phase-change memory, STT-RAM,
and memristors, promise user-level access to non-volatile storage through regular memory …

NV-Heaps: Making persistent objects fast and safe with next-generation, non-volatile memories

J Coburn, AM Caulfield, A Akel, LM Grupp… - ACM SIGARCH …, 2011 - dl.acm.org
Persistent, user-defined objects present an attractive abstraction for working with non-
volatile program state. However, the slow speed of persistent storage (ie, disk) has restricted …

Redesigning {LSMs} for Nonvolatile Memory with {NoveLSM}

S Kannan, N Bhat, A Gavrilovska… - 2018 USENIX Annual …, 2018 - usenix.org
We present NoveLSM, a persistent LSM-based key-value storage system designed to
exploit non-volatile memories and deliver low latency and high throughput to applications …