Predictive performance modeling for distributed batch processing using black box monitoring and machine learning

C Witt, M Bux, W Gusew, U Leser - Information Systems, 2019 - Elsevier
In many domains, the previous decade was characterized by increasing data volumes and
growing complexity of data analyses, creating new demands for batch processing on …

A comprehensive tutorial on science DMZ

J Crichigno, E Bou-Harb… - … Communications Surveys & …, 2018 - ieeexplore.ieee.org
Science and engineering applications are now generating data at an unprecedented rate.
From large facilities such as the Large Hadron Collider to portable DNA sequencing …

Bridging data center AI systems with edge computing for actionable information retrieval

Z Liu, A Ali, P Kenesei, A Miceli… - 2021 3rd Annual …, 2021 - ieeexplore.ieee.org
Extremely high data rates at modern synchrotron and X-ray free-electron laser light source
beamlines motivate the use of machine learning methods for data reduction, feature …

Transferring a petabyte in a day

R Kettimuthu, Z Liu, D Wheeler, I Foster… - Future Generation …, 2018 - Elsevier
Extreme-scale simulations and experiments can generate large amounts of data, whose
volume can exceed the compute and/or storage capacity at the simulation or experimental …

Cross-geography scientific data transferring trends and behavior

Z Liu, R Kettimuthu, I Foster, NSV Rao - Proceedings of the 27th …, 2018 - dl.acm.org
Wide area data transfers play an important role in many science applications but rely on
expensive infrastructure that often delivers disappointing performance in practice. In …

Characterization and identification of HPC applications at leadership computing facility

Z Liu, R Lewis, R Kettimuthu, K Harms… - Proceedings of the 34th …, 2020 - dl.acm.org
High Performance Computing (HPC) is an important method for scientific discovery via large-
scale simulation, data analysis, or artificial intelligence. Leadership-class supercomputers …

Data transfer between scientific facilities–bottleneck analysis, insights and optimizations

Y Liu, Z Liu, R Kettimuthu, N Rao… - 2019 19th IEEE/ACM …, 2019 - ieeexplore.ieee.org
Wide area file transfers play an important role in many science applications. File transfer
tools typically deliver the highest performance for datasets with a small number of large files …

SciStream: Architecture and toolkit for data streaming between federated science instruments

J Chung, W Zacherek, AJ Wisniewski, Z Liu… - Proceedings of the 31st …, 2022 - dl.acm.org
Modern scientific instruments, such as detectors at synchrotron light sources, generate data
at such high rates that online processing is needed for data reduction, feature detection …

Globus service enhancements for exascale applications and facilities

W Zheng, J Kordas, TJ Skluzacek… - … Journal of High …, 2024 - journals.sagepub.com
Many extreme-scale applications require the movement of large quantities of data to, from,
and among leadership computing facilities, as well as other scientific facilities and the home …

[PDF][PDF] The Modern Research Data Portal: a design pattern for networked, data-intensive science

K Chard, E Dart, I Foster, D Shifflett, S Tuecke… - PeerJ Computer …, 2018 - peerj.com
We describe best practices for providing convenient, high-speed, secure access to large
data via research data portals. We capture these best practices in a new design pattern, the …