Smartly handling renewable energy instability in supporting a cloud datacenter

J Gao, H Wang, H Shen - 2020 IEEE international parallel and …, 2020 - ieeexplore.ieee.org
The size and energy consumption of datacenters have been increasing significantly over the
past years. As a result, datacenters' increasing electricity monetary cost, energy …

FT-CNN: Algorithm-based fault tolerance for convolutional neural networks

K Zhao, S Di, S Li, X Liang, Y Zhai… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) are becoming more and more important for solving
challenging and critical problems in many fields. CNN inference applications have been …

[HTML][HTML] Artificial intelligence: An energy efficiency tool for enhanced high performance computing

AH Kelechi, MH Alsharif, OJ Bameyi, PJ Ezra… - Symmetry, 2020 - mdpi.com
Power-consuming entities such as high performance computing (HPC) sites and large data
centers are growing with the advance in information technology. In business, HPC is used to …

ePVF: An enhanced program vulnerability factor methodology for cross-layer resilience analysis

B Fang, Q Lu, K Pattabiraman… - 2016 46th Annual …, 2016 - ieeexplore.ieee.org
The Program Vulnerability Factor (PVF) has been proposed as a metric to understand the
impact of hardware faults on software. The PVF is calculated by identifying the program bits …

What does power consumption behavior of hpc jobs reveal?: Demystifying, quantifying, and predicting power consumption characteristics

T Patel, A Wagenhäuser, C Eibel… - 2020 IEEE …, 2020 - ieeexplore.ieee.org
As we approach exascale computing, large-scale HPC systems are becoming increasingly
power-constrained, requiring them to run HPC workloads in an energy-efficient manner. The …

GreenMM: energy efficient GPU matrix multiplication through undervolting

H Zamani, Y Liu, D Tripathy, L Bhuyan… - Proceedings of the ACM …, 2019 - dl.acm.org
The current trend of ever-increasing performance in scientific applications comes with
tremendous growth in energy consumption. In this paper, we present GreenMM framework …

[HTML][HTML] An Overview of Digital Transformation and Environmental Sustainability: Threats, Opportunities, and Solutions

A Goel, S Masurkar, GR Pathade - Sustainability, 2024 - mdpi.com
Digital transformation, powered by technologies like AI, IoT, and big data, is resha**
industries and societies at an unprecedented pace. While these innovations promise …

TSM2: optimizing tall-and-skinny matrix-matrix multiplication on GPUs

J Chen, N ** With Deep Reinforcement Learning
Y Wang, M Hao, H He, W Zhang, Q Tang… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Power and energy consumption is the limiting factor of modern computing systems. As the
GPU becomes a mainstream computing device, power management for GPUs becomes …

New-sum: A novel online abft scheme for general iterative methods

D Tao, SL Song, S Krishnamoorthy, P Wu… - Proceedings of the 25th …, 2016 - dl.acm.org
Emerging high-performance computing platforms, with large component counts and lower
power margins, are anticipated to be more susceptible to soft errors in both logic circuits and …