[BOOK][B] Parallel computing hits the power wall: principles, challenges, and a survey of solutions

AF Lorenzon, ACS Beck Filho - 2019 - books.google.com
This book describes several approaches to adaptability that are applied for the optimization
of parallel applications, such as thread-level parallelism exploitation and dynamic voltage …

Performance evaluation of intel optane memory for managed workloads

S Akram - ACM Transactions on Architecture and Code …, 2021 - dl.acm.org
Intel Optane memory offers non-volatility, byte addressability, and high capacity. It suits
managed workloads that prefer large main memory heaps. We investigate Optane as the …

Asymmetry-aware scalable locking

N Liu, J Gu, D Tang, K Li, B Zang, H Chen - Proceedings of the 27th ACM …, 2022 - dl.acm.org
The pursuit of power-efficiency is popularizing asymmetric multicore processors (AMP) such
as ARM big. LITTLE, Apple M1 and recent Intel Alder Lake with big and little cores. However …

Model-based optimization of the energy efficiency of multi-threaded applications

T Rauber, G Rünger, M Stachowski - Sustainable Computing: Informatics …, 2019 - Elsevier
Energy efficiency is considered to be a critical concern for modern hardware and a variety of
hardware features have been developed to improve the energy balance for executing …

DEP+ BURST: Online DVFS performance prediction for energy-efficient managed language execution

S Akram, JB Sartor, L Eeckhout - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
Making modern computer systems energy-efficient is of paramount importance. Dynamic
Voltage and Frequency Scaling (DVFS) is widely used to manage the energy and power …

RPPM: Rapid performance prediction of multithreaded workloads on multicore processors

S De Pestel, S Van den Steen, S Akram… - … Analysis of Systems …, 2019 - ieeexplore.ieee.org
Analytical performance modeling is a useful complement to detailed cycle-level simulation to
quickly explore the design space in an early design stage. Mechanistic analytical modeling …

Pac-Sim: Simulation of Multi-threaded Workloads using Intelligent, Live Sampling

C Liu, A Sabu, A Chaudhari, Q Kang… - ACM Transactions on …, 2024 - dl.acm.org
High-performance, multi-core processors are the key to accelerating workloads in several
application domains. To continue to scale performance at the limit of Moore's Law and …

DVFS virtualization for energy minimization of mixed-criticality dual-OS platforms

T Komori, Y Masuda, T Ishihara - 2022 IEEE 28th International …, 2022 - ieeexplore.ieee.org
A dual-OS platform can efficiently implement emerging mixed-criticality systems by
consolidating a real-time OS (RTOS) and a general-purpose OS (GPOS). Although the dual …

COS: A parallel performance model for dynamic variations in processor speed, memory speed, and thread concurrency

B Li, EA León, KW Cameron - … of the 26th International Symposium on …, 2017 - dl.acm.org
Highly-parallel, high-performance scientific applications must maximize performance inside
of a power envelope while maintaining scalability. Emergent parallel and distributed …

Maximizing heterogeneous processor performance under power constraints

A Adileh, S Eyerman, A Jaleel, L Eeckhout - ACM Transactions on …, 2016 - dl.acm.org
Heterogeneous processors (eg, ARM's big. LITTLE) improve performance in power-
constrained environments by executing applications on the 'little'low-power core and move …