A survey of fault-tolerance techniques for embedded systems from the perspective of power, energy, and thermal issues

S Safari, M Ansari, H Khdr, P Gohari-Nazari… - IEEE …, 2022 - ieeexplore.ieee.org
The relentless technology scaling has provided a significant increase in processor
performance, but on the other hand, it has led to adverse impacts on system reliability. In …

FLASH: Fast model adaptation in ML-centric cloud platforms

H Qiu, W Mao, A Patke, S Cui, C Wang… - Proceedings of …, 2024 - proceedings.mlsys.org
The emergence of ML in various cloud system management tasks (eg, workload autoscaling
and job scheduling) has become a core driver of ML-centric cloud platforms. However, there …

TherMa-MiCs: Thermal-aware scheduling for fault-tolerant mixed-criticality systems

S Safari, H Khdr, P Gohari-Nazari… - … on Parallel and …, 2021 - ieeexplore.ieee.org
Multicore platforms are becoming the dominant trend in designing Mixed-Criticality Systems
(MCSs), which integrate applications of different levels of criticality into the same platform. A …

Dvfo: Learning-based dvfs for energy-efficient edge-cloud collaborative inference

Z Zhang, Y Zhao, H Li, C Lin… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Due to limited resources on edge and different characteristics of deep neural network (DNN)
models, it is a big challenge to optimize DNN inference performance in terms of energy …

Thermal-aware standby-sparing technique on heterogeneous real-time embedded systems

M Ansari, S Safari, S Yari-Karin… - … on Emerging Topics …, 2021 - ieeexplore.ieee.org
Low power consumption, real-time computing, and high reliability are three key
requirements/design objectives of real-time embedded systems. The standby-sparing …

Passive primary/backup-based scheduling for simultaneous power and reliability management on heterogeneous embedded systems

S Yari-Karin, R Siyadatzadeh… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
In addition to meeting the real-time constraint, power/energy efficiency and high reliability
are two vital objectives for real-time embedded systems. Recently, heterogeneous multicore …

A multi-agent reinforcement learning-based method for server energy efficiency optimization combining DVFS and dynamic fan control

W Lin, W Lin, J Lin, H Zhong, J Wang, L He - … Computing: Informatics and …, 2024 - Elsevier
With the rapid development of the digital economy and intelligent industry, the energy
consumption of data centers (DCs) has increased significantly. Various optimization …

A predictive energy consumption scheduling algorithm for multiprocessor heterogeneous system

S Tian, W Ren, Q Deng, S Zou… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
With the increasing number of users and application in the era of the Industrial Internet of
Things (I-IoT), computing efficiency and energy consumption become two vital problems …

Energy-efficient computation with dvfs using deep reinforcement learning for multi-task systems in edge computing

X Li, T Zhou, H Wang, M Lin - arxiv preprint arxiv:2409.19434, 2024 - arxiv.org
Periodic soft real-time systems have broad applications in many areas, such as IoT. Finding
an optimal energy-efficient policy that is adaptable to underlying edge devices while …

RAVEN: reinforcement learning for generating verifiable run-time requirement enforcers for MPSoCs

K Esper, J Spieck, PL Sixdenier… - Fourth Workshop on …, 2023 - drops.dagstuhl.de
In embedded systems, applications frequently have to meet non-functional requirements
regarding, eg, real-time or energy consumption constraints, when executing on a given …