A task scheduling algorithm based on classification mining in fog computing environment

L Liu, D Qi, N Zhou, Y Wu - Wireless Communications and …, 2018‏ - Wiley Online Library
Fog computing (FC) is an emerging paradigm that extends computation, communication,
and storage facilities towards the edge of a network. In this heterogeneous and distributed …

Astra-sim2. 0: Modeling hierarchical networks and disaggregated systems for large-model training at scale

W Won, T Heo, S Rashidi, S Sridharan… - … Analysis of Systems …, 2023‏ - ieeexplore.ieee.org
As deep learning models and input data continue to scale at an unprecedented rate, it has
become inevitable to move towards distributed training platforms to fit the models and …

Astra-sim: Enabling sw/hw co-design exploration for distributed dl training platforms

S Rashidi, S Sridharan, S Srinivasan… - … Analysis of Systems …, 2020‏ - ieeexplore.ieee.org
Modern Deep Learning systems heavily rely on distributed training over high-performance
accelerator (eg, TPU, GPU)-based hardware platforms. Examples today include Google's …

Performance prediction of parallel applications: a systematic literature review

J Flores-Contreras, HA Duran-Limon… - The Journal of …, 2021‏ - Springer
Different techniques for estimating the execution time of parallel applications have been
studied for the last 25 years. These approaches have proposed different methods for …

Predicting the energy-consumption of MPI applications at scale using only a single node

FC Heinrich, T Cornebize, A Degomme… - … on cluster computing …, 2017‏ - ieeexplore.ieee.org
Monitoring and assessing the energy efficiency of supercomputers and data centers is
crucial in order to limit and reduce their energy consumption. Applications from the domain …

Negative perceptions about the applicability of source-to-source compilers in hpc: A literature review

R Milewicz, P Pirkelbauer, P Soundararajan… - … Computing: ISC High …, 2021‏ - Springer
A source-to-source compiler is a type of translator that accepts the source code of a program
written in a programming language as its input and produces an equivalent source code in …

Optically connected memory for disaggregated data centers

J Gonzalez, MG Palma, M Hattink… - Journal of Parallel and …, 2022‏ - Elsevier
Recent advances in integrated photonics enable the implementation of reconfigurable, high-
bandwidth, and low energy-per-bit interconnects in next-generation data centers. We …

Automated calibration of parallel and distributed computing simulators: A case study

J McDonald, M Horzela, F Suter… - 2024 IEEE International …, 2024‏ - ieeexplore.ieee.org
Many parallel and distributed computing research results are obtained in simulation, using
simulators that mimic real-world executions on some target system. Each such simulator is …

Compiler-assisted source-to-source skeletonization of application models for system simulation

JJ Wilke, JP Kenny, S Knight, S Rumley - High Performance Computing …, 2018‏ - Springer
Performance modeling of networks through simulation requires application endpoint models
that inject traffic into the simulation models. Endpoint models today for system-scale studies …

LLAMP: Assessing Network Latency Tolerance of HPC Applications with Linear Programming

S Shen, L Huang, M Chrapek… - … Conference for High …, 2024‏ - ieeexplore.ieee.org
The shift towards high-bandwidth networks driven by AI workloads in data centers and HPC
clusters has unintentionally aggravated network latency, adversely affecting the …