A Review on efficient thermal management of air-and liquid-cooled data centers: From chip to the cooling system

AH Khalaj, SK Halgamuge - Applied energy, 2017 - Elsevier
The growing global demand for services offered by data centers (DCs) has increased their
total power consumption and carbon emissions. Recent figures revealed that DCs account …

Towards joint optimization over ICT and cooling systems in data centre: A survey

W Zhang, Y Wen, YW Wong, KC Toh… - … Surveys & Tutorials, 2016 - ieeexplore.ieee.org
Effective management of ICT (information and communications technology) and cooling is
critical in modern data centres for high energy efficiency. This survey paper gives an …

Parallel programming with migratable objects: Charm++ in practice

B Acun, A Gupta, N Jain, A Langer… - SC'14: Proceedings …, 2014 - ieeexplore.ieee.org
The advent of petascale computing has introduced new challenges (eg Heterogeneity,
system failure) for programming scalable parallel applications. Increased complexity and …

Not all gpus are created equal: characterizing variability in large-scale, accelerator-rich systems

P Sinha, A Guliani, R Jain, B Tran… - … Conference for High …, 2022 - ieeexplore.ieee.org
Scientists are increasingly exploring and utilizing the massive parallelism of general-
purpose accelerators such as GPUs for scientific breakthroughs. As a result, datacenters …

Variation among processors under turbo boost in hpc systems

B Acun, P Miller, LV Kale - … of the 2016 International Conference on …, 2016 - dl.acm.org
The design and manufacture of present-day CPUs causes inherent variation in
supercomputer architectures such as variation in power and temperature of the chips. The …

Toward physics-guided safe deep reinforcement learning for green data center cooling control

R Wang, X Zhang, X Zhou, Y Wen… - 2022 ACM/IEEE 13th …, 2022 - ieeexplore.ieee.org
Deep reinforcement learning (DRL) has shown good performance in tackling Markov
decision process (MDP) problems. As DRL opti-mizes a long-term reward, it is a promising …

Fine-grained energy efficiency using per-core dvfs with an adaptive runtime system

B Acun, K Chandrasekar… - 2019 Tenth International …, 2019 - ieeexplore.ieee.org
Dynamic voltage and frequency scaling (DVFS) is a well-known technique to reduce the
power and/or energy consumption of various applications. While most processors provide …

Green data center cooling control via physics-guided safe reinforcement learning

R Wang, Z Cao, X Zhou, Y Wen, R Tan - ACM Transactions on Cyber …, 2024 - dl.acm.org
Deep reinforcement learning (DRL) has shown good performance in tackling Markov
decision process (MDP) problems. As DRL optimizes a long-term reward, it is a promising …

PAL: A Variability-Aware Policy for Scheduling ML Workloads in GPU Clusters

R Jain, B Tran, K Chen, MD Sinclair… - … Conference for High …, 2024 - ieeexplore.ieee.org
Large-scale computing systems are increasingly using accelerators such as GPUs to enable
peta-and exa-scale levels of compute to meet the needs of Machine Learning (ML) and …

[Књига][B] Programming models for parallel computing

P Balaji - 2015 - books.google.com
An overview of the most prominent contemporary parallel processing programming models,
written in a unique tutorial style. With the coming of the parallel computing era, computer …