[BOOK][B] Parallel computing hits the power wall: principles, challenges, and a survey of solutions
AF Lorenzon, ACS Beck Filho - 2019 - books.google.com
This book describes several approaches to adaptability that are applied for the optimization
of parallel applications, such as thread-level parallelism exploitation and dynamic voltage …
of parallel applications, such as thread-level parallelism exploitation and dynamic voltage …
Performance evaluation of intel optane memory for managed workloads
S Akram - ACM Transactions on Architecture and Code …, 2021 - dl.acm.org
Intel Optane memory offers non-volatility, byte addressability, and high capacity. It suits
managed workloads that prefer large main memory heaps. We investigate Optane as the …
managed workloads that prefer large main memory heaps. We investigate Optane as the …
Asymmetry-aware scalable locking
The pursuit of power-efficiency is popularizing asymmetric multicore processors (AMP) such
as ARM big. LITTLE, Apple M1 and recent Intel Alder Lake with big and little cores. However …
as ARM big. LITTLE, Apple M1 and recent Intel Alder Lake with big and little cores. However …
Model-based optimization of the energy efficiency of multi-threaded applications
T Rauber, G Rünger, M Stachowski - Sustainable Computing: Informatics …, 2019 - Elsevier
Energy efficiency is considered to be a critical concern for modern hardware and a variety of
hardware features have been developed to improve the energy balance for executing …
hardware features have been developed to improve the energy balance for executing …
DEP+ BURST: Online DVFS performance prediction for energy-efficient managed language execution
Making modern computer systems energy-efficient is of paramount importance. Dynamic
Voltage and Frequency Scaling (DVFS) is widely used to manage the energy and power …
Voltage and Frequency Scaling (DVFS) is widely used to manage the energy and power …
RPPM: Rapid performance prediction of multithreaded workloads on multicore processors
Analytical performance modeling is a useful complement to detailed cycle-level simulation to
quickly explore the design space in an early design stage. Mechanistic analytical modeling …
quickly explore the design space in an early design stage. Mechanistic analytical modeling …
Pac-Sim: Simulation of Multi-threaded Workloads using Intelligent, Live Sampling
High-performance, multi-core processors are the key to accelerating workloads in several
application domains. To continue to scale performance at the limit of Moore's Law and …
application domains. To continue to scale performance at the limit of Moore's Law and …
DVFS virtualization for energy minimization of mixed-criticality dual-OS platforms
T Komori, Y Masuda, T Ishihara - 2022 IEEE 28th International …, 2022 - ieeexplore.ieee.org
A dual-OS platform can efficiently implement emerging mixed-criticality systems by
consolidating a real-time OS (RTOS) and a general-purpose OS (GPOS). Although the dual …
consolidating a real-time OS (RTOS) and a general-purpose OS (GPOS). Although the dual …
COS: A parallel performance model for dynamic variations in processor speed, memory speed, and thread concurrency
Highly-parallel, high-performance scientific applications must maximize performance inside
of a power envelope while maintaining scalability. Emergent parallel and distributed …
of a power envelope while maintaining scalability. Emergent parallel and distributed …
Maximizing heterogeneous processor performance under power constraints
Heterogeneous processors (eg, ARM's big. LITTLE) improve performance in power-
constrained environments by executing applications on the 'little'low-power core and move …
constrained environments by executing applications on the 'little'low-power core and move …