AI-coupled HPC workflow applications, middleware and performance

W Brewer, A Gainaru, F Suter, F Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
AI integration is revolutionizing the landscape of HPC simulations, enhancing the
importance, use, and performance of AI-driven HPC workflows. This paper surveys the …

Xrbench: An extended reality (xr) machine learning benchmark suite for the metaverse

H Kwon, K Nair, J Seo, J Yik… - Proceedings of …, 2023 - proceedings.mlsys.org
Real-time multi-task multi-model (MTMM) workloads, a new form of deep learning inference
workloads, are emerging for applications areas like extended reality (XR) to support …

Application-driven exascale: The JUPITER benchmark suite

A Herten, S Achilles, D Alvarez… - … Conference for High …, 2024 - ieeexplore.ieee.org
Benchmarks are essential in the design of modern HPC installations, as they define key
aspects of system components. Beyond synthetic workloads, it is crucial to include real …

A gpu-specialized inference parameter server for large-scale deep recommendation models

Y Wei, M Langer, F Yu, M Lee, J Liu, J Shi… - Proceedings of the 16th …, 2022 - dl.acm.org
Recommendation systems are of crucial importance for a variety of modern apps and web
services, such as news feeds, social networks, e-commerce, search, etc. To achieve peak …

A novel protection design process to increase microgrid resilience

M Vygoder, F Banihashemi, J Gudex… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Successful discrimination of, isolation from, and recovery against short-circuit electrical faults
within microgrids having distributed energy resources (DERs) is challenging, as protection …

Workload interference prevention with intelligent routing and flexible job placement on dragonfly

Y Kang, X Wang, Z Lan - Proceedings of the 2023 ACM SIGSIM …, 2023 - dl.acm.org
Dragonfly is an indispensable interconnect topology for exascale HPC systems. To link tens
of thousands of compute nodes at a reasonable cost, Dragonfly shares network resources …

Fastml science benchmarks: Accelerating real-time scientific edge machine learning

J Duarte, N Tran, B Hawks, C Herwig, J Muhizi… - arxiv preprint arxiv …, 2022 - arxiv.org
Applications of machine learning (ML) are growing by the day for many unique and
challenging scientific applications. However, a crucial challenge facing these applications is …

MLPerf Power: Benchmarking the Energy Efficiency of Machine Learning Systems from Microwatts to Megawatts for Sustainable AI

A Tschand, ATR Rajan, S Idgunji, A Ghosh… - arxiv preprint arxiv …, 2024 - arxiv.org
Rapid adoption of machine learning (ML) technologies has led to a surge in power
consumption across diverse systems, from tiny IoT devices to massive datacenter clusters …

ScaleFold: Reducing AlphaFold initial training time to 10 hours

F Zhu, A Nowaczynski, R Li, J **n, Y Song… - Proceedings of the 61st …, 2024 - dl.acm.org
AlphaFold2 has been hailed as a breakthrough in protein folding. It can rapidly predict
protein structures with lab-grade accuracy. However, its training procedure is prohibitively …

Modoru: Clos nanosecond optical switching for distributed deep training

C Wang, N Yoshikane, D Elson… - Journal of Optical …, 2023 - ieeexplore.ieee.org
Distributed deep training has become a significant consumer of bandwidth across
datacenter-scale networks. The diverse parallel strategies employed in deep training require …