Rowpress: Amplifying read disturbance in modern dram chips

H Luo, A Olgun, AG Yağlıkçı, YC Tuğrul… - Proceedings of the 50th …, 2023 - dl.acm.org
Memory isolation is critical for system reliability, security, and safety. Unfortunately, read
disturbance can break memory isolation in modern DRAM chips. For example, RowHammer …

Artificial neural networks for space and safety-critical applications: Reliability issues and potential solutions

P Rech - IEEE Transactions on Nuclear Science, 2024 - ieeexplore.ieee.org
Machine learning is among the greatest advancements in computer science and
engineering and is today used to classify or detect objects, a key feature in autonomous …

An experimental analysis of RowHammer in HBM2 DRAM chips

A Olgun, M Osseiran, AG Yağlıkçı… - 2023 53rd Annual …, 2023 - ieeexplore.ieee.org
RowHammer (RH) is a significant and worsening security, safety, and reliability issue of
modern DRAM chips that can be exploited to break memory isolation. Therefore, it is …

A multi-level approach to evaluate the impact of GPU permanent faults on CNN's reliability

JER Condia, JD Guerrero-Balaguera… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Graphics processing units (GPUs) are widely used to accelerate Artificial Intelligence
applications, such as those based on Convolutional Neural Networks (CNNs). Since in …

Read disturbance in high bandwidth memory: A detailed experimental study on hbm2 dram chips

A Olgun, M Osseiran, AG Yağlıkçı… - 2024 54th Annual …, 2024 - ieeexplore.ieee.org
We experimentally demonstrate the effects of read disturbance (RowHammer and
RowPress) and uncover the inner workings of undocumented read disturbance defense …

Understanding the Effects of Permanent Faults in GPU's Parallelism Management and Control Units

JD Guerrero Balaguera, JE Rodriguez Condia… - Proceedings of the …, 2023 - dl.acm.org
Modern Graphics Processing Units (GPUs) demand life expectancy extended to many years,
exposing the hardware to aging (ie, permanent faults arising after the end-of-manufacturing …

Structural coding: A low-cost scheme to protect cnns from large-granularity memory faults

A Asgari Khoshouyeh, F Geissler, S Qutub… - Proceedings of the …, 2023 - dl.acm.org
The advent of High-Performance Computing has led to the adoption of Convolutional Neural
Networks (CNNs) in safety-critical applications such as autonomous vehicles. However …

Transient-fault-aware design and training to enhance dnns reliability with zero-overhead

N Cavagnero, F Dos Santos, M Ciccone… - 2022 IEEE 28th …, 2022 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) enable a wide series of technological advancements,
ranging from clinical imaging, to predictive industrial maintenance and autonomous driving …

Cross-Layer Reliability Evaluation and Efficient Hardening of Large Vision Transformers Models

L Roquet, F Fernandes dos Santos, P Rech… - Proceedings of the 61st …, 2024 - dl.acm.org
Vision Transformers (ViTs) are highly accurate Machine Learning (ML) models. However,
their large size and complexity increase the expected error rate due to hardware faults …

Unity ECC: Unified Memory Protection Against Bit and Chip Errors

D Kim, J Lee, W Jung, M Sullivan, J Kim - Proceedings of the …, 2023 - dl.acm.org
DRAM vendors utilize On-Die Error Correction Codes (OD-ECC) to correct random bit errors
internally. Meanwhile, system companies utilize Rank-Level ECC (RL-ECC) to protect data …