A survey of machine learning for computer architecture and systems

N Wu, Y **e - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
It has been a long time that computer architecture and systems are optimized for efficient
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …

A survey of machine learning applied to computer architecture design

DD Penney, L Chen - arxiv preprint arxiv:1909.12373, 2019 - arxiv.org
Machine learning has enabled significant benefits in diverse fields, but, with a few
exceptions, has had limited impact on computer architecture. Recent work, however, has …

zTT: Learning-based DVFS with zero thermal throttling for mobile devices

S Kim, K Bin, S Ha, K Lee, S Chong - GetMobile: Mobile Computing and …, 2022 - dl.acm.org
With the advent of mobile processors integrating CPU and GPU, high-performance tasks,
such as deep learning, gaming, and image processing are running on mobile devices. To …

Designing adaptive neural networks for energy-constrained image classification

D Stamoulis, TWR Chin, AK Prakash… - 2018 IEEE/ACM …, 2018 - ieeexplore.ieee.org
As convolutional neural networks (CNNs) enable state-of-the-art computer vision
applications, their high energy consumption has emerged as a key impediment to their …

Hardware-aware machine learning: Modeling and optimization

D Marculescu, D Stamoulis… - 2018 IEEE/ACM …, 2018 - ieeexplore.ieee.org
Recent breakthroughs in Machine Learning (ML) applications, and especially in Deep
Learning (DL), have made DL models a key component in almost every modern computing …

Power-aware checkpointing for multicore embedded systems

M Ansari, S Safari, H Khdr… - … on Parallel and …, 2022 - ieeexplore.ieee.org
Increasing the number of cores integrated on a single chip offers a great potential for the
implementation of fault-tolerant techniques to achieve high reliability in real-time embedded …

NPU-accelerated imitation learning for thermal optimization of QoS-constrained heterogeneous multi-cores

M Rapp, H Khdr, N Krohmer, J Henkel - ACM Transactions on Design …, 2023 - dl.acm.org
Thermal optimization of a heterogeneous clustered multi-core processor under user-defined
QoS targets requires application migration and DVFS. However, selecting the core to …

F-LEMMA: Fast learning-based energy management for multi-/many-core processors

A Zou, K Garimella, B Lee, C Gill, X Zhang - Proceedings of the 2020 …, 2020 - dl.acm.org
Over the last two decades, as microprocessors have evolved to achieve higher
computational performance, their power density also has increased at an accelerated rate …

[PDF][PDF] Mapformer: Attention-based multidnn manager for throughout & power co-optimization on embedded devices

A Karatzas, I Anagnostopoulos - Power, 2024 - engr.siu.edu
In the context of modern services that use multiple Deep Neural Networks (DNNs),
managing workloads on embedded devices presents unique challenges. These devices …

Multi-Agent Reinforcement Learning for Thermally-Restricted Performance Optimization on Manycores

H Khdr, ME Batur, K Zhou, MB Sikal… - … Design, Automation & …, 2024 - ieeexplore.ieee.org
The problem of performance maximization under a thermal constraint has been tackled by
means of dynamic voltage and frequency scaling (DVFS) in many system-level optimization …