[BOOK][B] Fault-tolerant message-passing distributed systems: an algorithmic approach

M Raynal - 2018 - books.google.com
This book presents the most important fault-tolerant distributed programming abstractions
and their associated distributed algorithms, in particular in terms of reliable communication …

The/spl phi/accrual failure detector

N Hayashibara, X Defago, R Yared… - Proceedings of the …, 2004 - ieeexplore.ieee.org
The detection of failures is a fundamental issue for fault-tolerance in distributed systems.
Recently, many people have come to realize that failure detection ought to be provided as …