Testability and dependability of AI hardware: Survey, trends, challenges, and perspectives

F Su, C Liu, HG Stratigopoulos - IEEE Design & Test, 2023 - ieeexplore.ieee.org
Hardware realization of artificial intelligence (AI) requires new design styles and even
underlying technologies than those used in traditional digital processors or logic circuits …

A systematic literature review on hardware reliability assessment methods for deep neural networks

MH Ahmadilivani, M Taheri, J Raik… - ACM Computing …, 2024 - dl.acm.org
Artificial Intelligence (AI) and, in particular, Machine Learning (ML), have emerged to be
utilized in various applications due to their capability to learn how to solve complex …

Ares: A framework for quantifying the resilience of deep neural networks

B Reagen, U Gupta, L Pentecost… - Proceedings of the 55th …, 2018 - dl.acm.org
As the use of deep neural networks continues to grow, so does the fraction of compute
cycles devoted to their execution. This has led the CAD and architecture communities to …

Nvbit: A dynamic binary instrumentation framework for nvidia gpus

O Villa, M Stephenson, D Nellans… - Proceedings of the 52nd …, 2019 - dl.acm.org
Binary instrumentation frameworks are widely used to implement profilers, performance
evaluation, error checking, and bug detection tools. While dynamic binary instrumentation …

Analyzing and increasing the reliability of convolutional neural networks on GPUs

FF dos Santos, PF Pimenta, C Lunardi… - IEEE Transactions …, 2018 - ieeexplore.ieee.org
Graphics processing units (GPUs) are playing a critical role in convolutional neural networks
(CNNs) for image detection. As GPU-enabled CNNs move into safety-critical environments …

A survey on modeling and improving reliability of DNN algorithms and accelerators

S Mittal - Journal of Systems Architecture, 2020 - Elsevier
As DNNs become increasingly common in mission-critical applications, ensuring their
reliable operation has become crucial. Conventional resilience techniques fail to account for …

Ml-based fault injection for autonomous vehicles: A case for bayesian fault injection

S Jha, S Banerjee, T Tsai, SKS Hari… - 2019 49th annual …, 2019 - ieeexplore.ieee.org
The safety and resilience of fully autonomous vehicles (AVs) are of significant concern, as
exemplified by several headline-making accidents. While AV development today involves …

Pytorchfi: A runtime perturbation tool for dnns

A Mahmoud, N Aggarwal, A Nobbe… - 2020 50th Annual …, 2020 - ieeexplore.ieee.org
PyTorchFI is a runtime perturbation tool for deep neural networks (DNNs), implemented for
the popular PyTorch deep learning platform. PyTorchFI enables users to perform …

Nvbitfi: Dynamic fault injection for gpus

T Tsai, SKS Hari, M Sullivan, O Villa… - 2021 51st Annual …, 2021 - ieeexplore.ieee.org
GPUs have found wide acceptance in domains such as high-performance computing and
autonomous vehicles, which require fast processing of large amounts of data along with …

Artificial neural networks for space and safety-critical applications: Reliability issues and potential solutions

P Rech - IEEE Transactions on Nuclear Science, 2024 - ieeexplore.ieee.org
Machine learning is among the greatest advancements in computer science and
engineering and is today used to classify or detect objects, a key feature in autonomous …