Frame: Fault tolerant and real-time messaging for edge computing

C Wang, C Gill, C Lu - 2019 IEEE 39th International …, 2019 - ieeexplore.ieee.org
Edge computing systems for Industrial Internet of Things (IIoT) applications require reliable
and timely message delivery. Both latency discrepancies within edge clouds, and …

Recovery algorithms for paxos-based state machine replication

J Kończak, PT Wojciechowski, N Santos… - … on Dependable and …, 2019 - ieeexplore.ieee.org
In this article, we propose and evaluate three different state recovery algorithms aimed for
Paxos-one of the most popular distributed agreement protocols. Paxos is commonly used to …

Fast replica recovery and adaptive consistency preservation for edge cloud system

J Guo, C Li, Y Luo - Soft Computing, 2020 - Springer
Edge cloud extends the power of cloud computing to the edge of the devices that are closest
to the demands of big connection, low latency and large bandwidth. However, there are still …

Building global and scalable systems with atomic multicast

S Benz, PJ Marandi, F Pedone… - Proceedings of the 15th …, 2014 - dl.acm.org
The rise of worldwide Internet-scale services demands large distributed systems. Indeed,
when handling several millions of users, it is common to operate thousands of servers …

Total Execution Order in Fault-Tolerant Real-Time Systems

A Naghavi, N Navet - Proceedings of the 32nd International Conference …, 2024 - dl.acm.org
Many real-time systems nowadays must not only tolerate accidental faults but also targeted
attacks. Typically, techniques such as replication and diversification are used to mask the …

Reducing Persistence Overhead in Parallel State Machine Replication through Time-Phased Partitioned Checkpoint

E Gomes Jr, E Alchieri, F Dotti… - Journal of Internet …, 2024 - journals-sol.sbc.org.br
Dependable systems usually rely on replication to provide resilience and availability.
However, for long-lived systems, replication is not enough since given a sufficient amount of …

Checkpointing techniques in distributed systems: A synopsis of diverse strategies over the last decades

H Goulart, A Franco, O Mendizabal - … do XXIV Workshop de Testes e …, 2023 - sol.sbc.org.br
This paper concisely reviews checkpointing techniques in distributed systems, focusing on
various aspects such as coordinated and uncoordinated checkpointing, incremental …

Boosting state machine replication with concurrent execution

E Alchieri, F Dotti, P Marandi… - 2018 Eighth Latin …, 2018 - ieeexplore.ieee.org
State machine replication is a fundamental technique to render services fault tolerant. One of
the key assumptions of state machine replication is that replicas must execute operations …

Failure Recovery from Persistent Memory in Paxos-Based State Machine Replication

J Kończak, PT Wojciechowski - 2021 40th International …, 2021 - ieeexplore.ieee.org
Paxos is one of the most popular protocols for state machine replication (a technique used
for making services highly available). We are the first to propose a Paxos-based state …

Dynamic state partitioning in parallelized byzantine fault tolerance

B Li, W Xu, R Kapitza - 2018 48th Annual IEEE/IFIP …, 2018 - ieeexplore.ieee.org
Recent research works have shown that applying parallelization to request processing in
Byzantine Fault Tolerance (BFT) can bring significant performance improvement. Based on …