Integrating quantum computing resources into scientific HPC ecosystems
Quantum Computing (QC) offers significant potential to enhance scientific discovery in fields
such as quantum chemistry, optimization, and artificial intelligence. Yet QC faces challenges …
such as quantum chemistry, optimization, and artificial intelligence. Yet QC faces challenges …
A survey on checkpointing strategies: Should we always checkpoint à la Young/Daly?
Abstract The Young/Daly formula provides an approximation of the optimal checkpointing
period for a parallel application executing on a supercomputing platform. It was originally …
period for a parallel application executing on a supercomputing platform. It was originally …
A digital twin framework for liquid-cooled supercomputers as demonstrated at exascale
We present ExaDigiT, an open-source framework for develo** comprehensive digital
twins of liquid-cooled supercomputers. It integrates three main modules:(1) a resource …
twins of liquid-cooled supercomputers. It integrates three main modules:(1) a resource …
[HTML][HTML] GPU-enabled extreme-scale turbulence simulations: Fourier pseudo-spectral algorithms at the exascale using OpenMP offloading
Fourier pseudo-spectral methods for nonlinear partial differential equations are of wide
interest in many areas of advanced computational science, including direct numerical …
interest in many areas of advanced computational science, including direct numerical …
Distributed computing for physics-based data-driven reduced modeling at scale: Application to a rotating detonation rocket engine
IG Farcas, RP Gundevia, R Munipalli… - arxiv preprint arxiv …, 2024 - arxiv.org
High-performance computing (HPC) has revolutionized our ability to perform detailed
simulations of complex real-world processes. A prominent contemporary example is from …
simulations of complex real-world processes. A prominent contemporary example is from …
Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers
Training and fine-tuning large language models (LLMs) with hundreds of billions to trillions
of parameters requires tens of thousands of GPUs, and a highly scalable software stack. In …
of parameters requires tens of thousands of GPUs, and a highly scalable software stack. In …
Bandwidth Characterization of DeepSpeed on Distributed Large Language Model Training
The exponential growth of the training dataset and the size of the large language model
(LLM) significantly outpaces the incremental memory capacity increase in the graphics pro …
(LLM) significantly outpaces the incremental memory capacity increase in the graphics pro …
A framework for integrating quantum simulation and high performance computing
Scientific applications are starting to explore the viability of quantum computing. This
exploration typically begins with quantum simulations that can run on existing classical …
exploration typically begins with quantum simulations that can run on existing classical …
Exploring gpu-to-gpu communication: Insights into supercomputer interconnects
Multi-GPU nodes are increasingly common in the rapidly evolving landscape of exascale
supercomputers. On these systems, GPUs on the same node are connected through …
supercomputers. On these systems, GPUs on the same node are connected through …
[HTML][HTML] Whispering gallery mode sensing through the lens of quantum optics, artificial intelligence, and nanoscale catalysis
Ultra-sensitive sensors based on the resonant properties of whispering gallery modes
(WGMs) can detect fractional changes in nanoscale environments down to the length and …
(WGMs) can detect fractional changes in nanoscale environments down to the length and …