[PDF][PDF] An overview of the Singularity project

G Hunt, JR Larus, M Abadi, M Aiken, P Barham… - 2005 - academia.edu
Singularity is a research project in Microsoft Research that started with the question: what
would a software platform look like if it was designed from scratch with the primary goal of …

[PDF][PDF] {ZooKeeper}: Wait-free coordination for internet-scale systems

P Hunt, M Konar, FP Junqueira, B Reed - 2010 USENIX Annual …, 2010 - usenix.org
In this paper, we describe ZooKeeper, a service for coordinating processes of distributed
applications. Since ZooKeeper is part of critical infrastructure, ZooKeeper aims to provide a …

There is more consensus in egalitarian parliaments

I Moraru, DG Andersen, M Kaminsky - Proceedings of the Twenty-Fourth …, 2013 - dl.acm.org
This paper describes the design and implementation of Egalitarian Paxos (EPaxos), a new
distributed consensus algorithm based on Paxos. EPaxos achieves three goals:(1) optimal …

Bigtable: A distributed storage system for structured data

F Chang, J Dean, S Ghemawat, WC Hsieh… - ACM Transactions on …, 2008 - dl.acm.org
Bigtable is a distributed storage system for managing structured data that is designed to
scale to a very large size: petabytes of data across thousands of commodity servers. Many …

The Chubby lock service for loosely-coupled distributed systems

M Burrows - Proceedings of the 7th symposium on Operating …, 2006 - usenix.org
We describe our experiences with the Chubby lock service, which is intended to provide
coarse-grained locking as well as reliable (though low-volume) storage for a loosely …

[PDF][PDF] Finding a needle in haystack: Facebook's photo storage

D Beaver, S Kumar, HC Li, J Sobel… - 9th USENIX Symposium on …, 2010 - usenix.org
This paper describes Haystack, an object storage system optimized for Facebook's Photos
application. Facebook currently stores over 260 billion images, which translates to over 20 …

Rbft: Redundant byzantine fault tolerance

PL Aublin, SB Mokhtar, V Quéma - 2013 IEEE 33rd …, 2013 - ieeexplore.ieee.org
Byzantine Fault Tolerant state machine replication (BFT) protocols are replication protocols
that tolerate arbitrary faults of a fraction of the replicas. Although significant efforts have been …

PNUTS: Yahoo!'s hosted data serving platform

BF Cooper, R Ramakrishnan, U Srivastava… - Proceedings of the …, 2008 - dl.acm.org
We describe PNUTS, a massively parallel and geographically distributed database system
for Yahoo!'s web applications. PNUTS provides data storage organized as hashed or …

Zab: High-performance broadcast for primary-backup systems

FP Junqueira, BC Reed… - 2011 IEEE/IFIP 41st …, 2011 - ieeexplore.ieee.org
Zab is a crash-recovery atomic broadcast algorithm we designed for the ZooKeeper
coordination service. ZooKeeper implements a primary-backup scheme in which a primary …

Paxos made live: an engineering perspective

TD Chandra, R Griesemer, J Redstone - … of the twenty-sixth annual ACM …, 2007 - dl.acm.org
We describe our experience in building a fault-tolerant data-base using the Paxos
consensus algorithm. Despite the existing literature in the field, building such a database …