LITE kernel RDMA support for datacenter applications

SY Tsai, Y Zhang - Proceedings of the 26th Symposium on Operating …, 2017 - dl.acm.org
Recently, there is an increasing interest in building data-center applications with RDMA
because of its low-latency, high-throughput, and low-CPU-utilization benefits. However …

Apus: Fast and scalable paxos on rdma

C Wang, J Jiang, X Chen, N Yi, H Cui - Proceedings of the 2017 …, 2017 - dl.acm.org
State machine replication (SMR) uses Paxos to enforce the same inputs for a program (eg,
Redis) replicated on a number of hosts, tolerating various types of failures. Unfortunately …

Hermes: A fast, fault-tolerant and linearizable replication protocol

A Katsarakis, V Gavrielatos, MRS Katebzadeh… - Proceedings of the …, 2020 - dl.acm.org
Today's datacenter applications are underpinned by datastores that are responsible for
providing availability, consistency, and performance. For high availability in the presence of …

The impact of RDMA on agreement

MK Aguilera, N Ben-David, R Guerraoui… - Proceedings of the …, 2019 - dl.acm.org
Remote Direct Memory Access (RDMA) is becoming widely available in data centers. This
technology allows a process to directly read and write the memory of a remote host, with a …

Scalog: Seamless reconfiguration and total order in a scalable shared log

C Ding, D Chu, E Zhao, X Li, L Alvisi… - … USENIX Symposium on …, 2020 - usenix.org
The shared log paradigm is at the heart of modern distributed applications in the growing
cloud computing industry. Often, application logs must be stored durably for analytics …

Recent Advancements In Distributed System Communications

I Argyroulis - arxiv preprint arxiv:2107.01381, 2021 - arxiv.org
Overheads in Operating System kernel network stacks and sockets have been hindering
OSes from managing networking operations efficiently for years. Moreover, when building …

Invalidation-based protocols for replicated datastores

A Katsarakis - arxiv preprint arxiv:2112.02405, 2021 - arxiv.org
Distributed in-memory datastores underpin cloud applications that run within a datacenter
and demand high performance, strong consistency, and availability. A key feature of …

[PDF][PDF] Theoretical Foundations for Practical Concurrent and Distributed Computation

N Ben-David - 2020 - reports-archive.adm.cs.cmu.edu
Many large-scale computations are nowadays computed using several processes, whether
on a single multi-core machine, or distributed over many machines. This wide-spread use of …

On the Importance of Infrastructure-Awareness in Large-Scale Distributed Storage Systems

SMS Rizvi - 2021 - uwspace.uwaterloo.ca
Big data applications put significant latency and throughput demands on distributed storage
systems. Meeting these demands requires storage systems to use a significant amount of …

Certifying safety when implementing consensus

A Panda - arxiv preprint arxiv:1903.03501, 2019 - arxiv.org
Ensuring the correctness of distributed system implementations remains a challenging and
largely unaddressed problem. In this paper we present a protocol that can be used to certify …