Not all gpus are created equal: characterizing variability in large-scale, accelerator-rich systems

P Sinha, A Guliani, R Jain, B Tran… - … Conference for High …, 2022 - ieeexplore.ieee.org
Scientists are increasingly exploring and utilizing the massive parallelism of general-
purpose accelerators such as GPUs for scientific breakthroughs. As a result, datacenters …

Evaluation of power management control on the supercomputer fugaku

Y Kodama, T Odajima, E Arima… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
The supercomputer “Fugaku”, which recently ranked number one on multiple
supercomputing lists, including the Top500 in June 2020, has various power control …

Energy hardware and workload aware job scheduling towards interconnected HPC environments

M D'Amico, JC Gonzalez - IEEE Transactions on Parallel and …, 2021 - ieeexplore.ieee.org
New HPC machines are getting close to the exascale. Power consumption for those
machines has been increasing, and researchers are studying ways to reduce it. A second …

Td-nuca: runtime driven management of nuca caches in task dataflow programming models

P Caheny, L Alvarez, M Casas… - … Conference for High …, 2022 - ieeexplore.ieee.org
In high performance processors, the design of on-chip memory hierarchies is crucial for
performance and energy efficiency. Current processors rely on large shared Non-Uniform …

Toward Sustainable HPC: In-Production Deployment of Incentive-Based Power Efficiency Mechanism on the Fugaku Supercomputer

ALV Solórzano, K Sato, K Yamamoto… - … Conference for High …, 2024 - ieeexplore.ieee.org
This paper describes the deployment and operational experience of a novel incentive-based
power-control strategy on the Fugaku supercomputer. Our incentive-based program, termed …

A data center demand response policy for real-world workload scenarios in HPC

Y Zhang, DC Wilson, IC Paschalidis… - … Design, Automation & …, 2021 - ieeexplore.ieee.org
Demand response programs offer an opportunity for large power consumers to save on
electricity costs by modulating their power consumption in response to demand changes in …

sys-sage: A Unified Representation of Dynamic Topologies & Attributes on HPC Systems

S Vanecek, M Schulz - Proceedings of the 38th ACM International …, 2024 - dl.acm.org
HPC systems are getting ever more powerful, but this comes at the price of increasing
system complexity: node architectures are deeply hierarchical and in many cases …

Light-weight prediction for improving energy consumption in HPC platforms

D Carastan-Santos, G Da Costa, M Poquet… - … Conference on Parallel …, 2024 - Springer
With the increase of demand for computing resources and the struggle to provide the
necessary energy, power-aware resource management is becoming a major issue for the …

Stereo: Assignment and scheduling in MPSoC under process variation by combining stochastic and decomposition approaches

B Khodabandeloo, A Khonsari… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
Aggressive scaling in integrated circuits creates new challenges such as an increase in
power density, temperature, and especially process variation in designing Multiprocessor …

Analyzing performance and power-efficiency variations among nvidia gpus

K Yoshida, R Sageyama, S Miwa, H Yamaki… - Proceedings of the 51st …, 2022 - dl.acm.org
Understanding the variations in performance and power-efficiency of compute nodes is
important for enhancing these factors in modern supercomputing systems. Previous studies …