AI-coupled HPC workflow applications, middleware and performance

W Brewer, A Gainaru, F Suter, F Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
AI integration is revolutionizing the landscape of HPC simulations, enhancing the
importance, use, and performance of AI-driven HPC workflows. This paper surveys the …

Optimizing inference serving on serverless platforms

A Ali, R Pinciroli, F Yan, E Smirni - Proceedings of the VLDB Endowment, 2022 - par.nsf.gov
Serverless computing is gaining popularity for machine learning (ML) serving workload due
to its autonomous resource scaling, easy to use and pay-per-use cost model. Existing …

Linking scientific instruments and computation: Patterns, technologies, and experiences

R Vescovi, R Chard, ND Saint, B Blaiszik, J Pruyne… - Patterns, 2022 - cell.com
Powerful detectors at modern experimental facilities routinely collect data at multiple GB/s.
Online analysis methods are needed to enable the collection of only interesting subsets of …

Globus automation services: Research process automation across the space–time continuum

R Chard, J Pruyne, K McKee, J Bryan… - Future Generation …, 2023 - Elsevier
Research process automation–the reliable, efficient, and reproducible execution of linked
sets of actions on scientific instruments, computers, data stores, and other resources–has …

FAIR principles for AI models with a practical application for accelerated high energy diffraction microscopy

N Ravi, P Chaturvedi, EA Huerta, Z Liu, R Chard… - Scientific Data, 2022 - nature.com
A concise and measurable set of FAIR (Findable, Accessible, Interoperable and Reusable)
principles for scientific data is transforming the state-of-practice for data management and …

Fast and accurate learned multiresolution dynamical downscaling for precipitation

J Wang, Z Liu, I Foster, W Chang… - Geoscientific Model …, 2021 - gmd.copernicus.org
This study develops a neural network-based approach for emulating high-resolution
modeled precipitation data with comparable statistical properties but at greatly reduced …

Frontiers in scientific workflows: Pervasive integration with high-performance computing

RF Da Silva, RM Badia, D Bard, IT Foster, S Jha… - Computer, 2024 - ieeexplore.ieee.org
We address the increasing complexity of scientific workflows in the context of high-
performance computing (HPC) and their associated need for robust, adaptable, and flexible …

A comprehensive evaluation of novel AI accelerators for deep learning workloads

M Emani, Z **e, S Raskar, V Sastry… - 2022 IEEE/ACM …, 2022 - ieeexplore.ieee.org
Scientific applications are increasingly adopting Artificial Intelligence (AI) techniques to
advance science. High-performance computing centers are evaluating emerging novel …

Smlt: A serverless framework for scalable and adaptive machine learning design and training

A Ali, S Zawad, P Aditya, IE Akkus, R Chen… - arxiv preprint arxiv …, 2022 - arxiv.org
In today's production machine learning (ML) systems, models are continuously trained,
improved, and deployed. ML design and training are becoming a continuous workflow of …

Streaming data from experimental facilities to supercomputers for real-time data processing

S Veseli, J Hammonds, S Henke, H Parraga… - Proceedings of the SC' …, 2023 - dl.acm.org
In this paper we demonstrate direct data streaming from instruments and detectors at a large-
scale experimental facility to a supercomputer for real-time data processing and feedback …