Toward a smart cloud: A review of fault-tolerance methods in cloud systems

MA Mukwevho, T Celik - IEEE Transactions on Services …, 2018 - ieeexplore.ieee.org
This paper presents a comprehensive survey of the state-of-the-art work on fault tolerance
methods proposed for cloud computing. The survey classifies fault-tolerance methods into …

GPU devices for safety-critical systems: A survey

J Perez-Cerrolaza, J Abella, L Kosmidis… - ACM Computing …, 2022 - dl.acm.org
Graphics Processing Unit (GPU) devices and their associated software programming
languages and frameworks can deliver the computing performance required to facilitate the …

T-storm: Traffic-aware online scheduling in storm

J Xu, Z Chen, J Tang, S Su - 2014 IEEE 34th International …, 2014 - ieeexplore.ieee.org
Storm has emerged as a promising computation platform for stream data processing. In this
paper, we first show inefficiencies of the current practice of Storm scheduling and challenges …

Mutations: How close are they to real faults?

R Gopinath, C Jensen, A Groce - 2014 IEEE 25th International …, 2014 - ieeexplore.ieee.org
Mutation analysis is often used to compare the effectiveness of different test suites or testing
techniques. One of the main assumptions underlying this technique is the Competent …

FAIL*: An open and versatile fault-injection framework for the assessment of software-implemented hardware fault tolerance

H Schirmeier, M Hoffmann, C Dietrich… - 2015 11th european …, 2015 - ieeexplore.ieee.org
Due to voltage and structure shrinking, the influence of radiation on a circuit's operation
increases, resulting in future hardware designs exhibiting much higher rates of soft errors …

[หนังสือ][B] Performance, reliability, and availability evaluation of computational systems, volume I: performance and background

PRM Maciel - 2023 - taylorfrancis.com
This textbook intends to be a comprehensive and substantially self-contained two-volume
book covering performance, reliability, and availability evaluation subjects. The volumes …

Hardware-in-the-loop fault injection for traction control system

X Yang, C Yang, T Peng, Z Chen… - IEEE Journal of …, 2018 - ieeexplore.ieee.org
This paper presents a multiprocessor hardware-in-the-loop (HIL) fault injection strategy for
real-time simulation of faults in traction control system (TCS). TCS models are solved for the …

Avoiding pitfalls in fault-injection based comparison of program susceptibility to soft errors

H Schirmeier, C Borchert… - 2015 45th Annual IEEE …, 2015 - ieeexplore.ieee.org
Since the first identification of physical causes for soft errors in memory circuits, fault
injection (FI) has grown into a standard methodology to assess the fault resilience of …

Exploring fault parameter space using reinforcement learning-based fault injection

M Moradi, BJ Oakes, M Saraoglu… - 2020 50th Annual …, 2020 - ieeexplore.ieee.org
Assessing the safety of complex Cyber-Physical Systems (CPS) is a challenge in any
industry. Fault Injection (FI) is a proven technique for safety analysis and is recommended by …

[HTML][HTML] Hardware-in-the-loop-based real-time fault injection framework for dynamic behavior analysis of automotive software systems

M Abboush, D Bamal, C Knieke, A Rausch - Sensors, 2022 - mdpi.com
A well-known challenge in the development of safety-critical systems in vehicles today is that
reliability and safety assessment should be rigorously addressed and monitored. As a matter …