Halfmoon: Log-optimal fault-tolerant stateful serverless computing

S Qi, X Liu, X ** - Proceedings of the 29th Symposium on Operating …, 2023 - dl.acm.org
Serverless computing separates function execution from state management. Simple retry-
based fault tolerance might corrupt the shared state with duplicate updates. Existing …

Chop chop: Byzantine atomic broadcast to the network limit

M Camaioni, R Guerraoui, M Monti, PL Roman… - … USENIX Symposium on …, 2024 - usenix.org
At the heart of state machine replication, the celebrated technique enabling decentralized
and secure universal computation, lies Atomic Broadcast, a fundamental communication …

{SwiftPaxos}: Fast {Geo-Replicated} State Machines

F Ryabinin, A Gotsman, P Sutra - 21st USENIX Symposium on …, 2024 - usenix.org
Cloud services improve their availability by replicating data across sites in different
geographical regions. A variety of state-machine replication protocols have been proposed …

Bandle: Asynchronous State Machine Replication Made Efficient

B Wang, S Liu, H Dong, X Wang, W Xu… - Proceedings of the …, 2024 - dl.acm.org
State machine replication (SMR) uses consensus as its core component for reaching
agreement among a group of processes, in order to provide fault-tolerant services. Most …

Hydra:{Serialization-Free} Network Ordering for Strongly Consistent Distributed Applications

I Choi, E Michael, Y Li, DRK Ports, J Li - 20th USENIX Symposium on …, 2023 - usenix.org
Many distributed systems, eg, state machine replication and distributed databases, rely on
establishing a consistent order of operations on groups of nodes in the system. Traditionally …

SWARM: Replicating Shared Disaggregated-Memory Data in No Time

A Murat, C Burgelin, A Xygkis, I Zablotchi… - Proceedings of the …, 2024 - dl.acm.org
Memory disaggregation is an emerging data center architecture that improves resource
utilization and scalability. Replication is key to ensure the fault tolerance of applications, but …

Primcast: a latency-efficient atomic multicast

L Pacheco, P Coelho, F Pedone - Proceedings of the 24th International …, 2023 - dl.acm.org
Atomic multicast is a communication abstraction that allows for messages to be addressed to
and reliably delivered by multiple process groups, while ensuring a partial order on …

Regular sequential serializability and regular sequential consistency

J Helt, M Burke, A Levy, W Lloyd - Proceedings of the ACM SIGOPS 28th …, 2021 - dl.acm.org
Strictly serializable (linearizable) services appear to execute transactions (operations)
sequentially, in an order consistent with real time. This restricts a transaction's (operation's) …

Targeting Tail Latency in Replicated Systems with Proactive Rejection

L Lawniczak, T Distler - … of the 25th International Middleware Conference, 2024 - dl.acm.org
When put under stress, traditional state-machine replication protocols typically exhibit
response times that by far exceed the average level of normal-case operation. The common …

Racos: Improving Erasure Coding State Machine Replication using Leaderless Consensus

J Zarnstorff, L Lebow, C Siems, D Remuck… - Proceedings of the …, 2024 - dl.acm.org
Cloud storage systems often adopt state machine replication (SMR) to ensure reliability and
availability. Most SMR systems use" full-copy" replication across all nodes, which leads to …