Lossy compression of communication traces using recurrent neural networks

J Sun, T Yan, H Sun, H Lin… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In high performance computing (HPC) systems, collecting and replaying communication
traces are fundamental approaches to analyze performance. With increasingly large-scale …

PerFlow: A domain specific framework for automatic performance analysis of parallel applications

Y **, H Wang, R Zhong, C Zhang, J Zhai - Proceedings of the 27th ACM …, 2022 - dl.acm.org
Performance analysis is widely used to identify performance issues of parallel applications.
However, complex communications and data dependence, as well as the interactions …

Graph-Centric Performance Analysis for Large-Scale Parallel Applications

Y **, H Wang, R Zhong, C Zhang… - … on Parallel and …, 2024 - ieeexplore.ieee.org
Performance analysis is essential for understanding the performance behaviors of parallel
programs and detecting performance bottlenecks. Whereas, complex interconnections …

Persisting and Reusing Results of Static Program Analyses on a Large Scale

J Düsing, B Hermann - 2023 38th IEEE/ACM International …, 2023 - ieeexplore.ieee.org
Static Program Analysis (SPA) has long been established as an important technique for
gaining insights into software systems. Over the last years, analysis designers increasingly …

Siesta: Synthesizing Proxy Applications for MPI Programs

J Luo, T Yan, Q Xu, J Sun, G Sun - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Proxy applications (proxy-apps) are basic tools for evaluating the performance of specific
workloads on high-performance computing (HPC) systems. Since the development of high …

[BOOK][B] Performance Analysis of Parallel Applications for HPC

J Zhai, Y **, W Chen, W Zheng - 2023 - Springer
Modern supercomputers have brought about an unprecedented growth in computing power.
However, many parallel applications fail to efficiently utilize this power due to performance …

Synthesizing Proxy Applications for MPI Programs

T Yan, Q Xu, J Luo, J Sun, G Sun - arxiv preprint arxiv:2301.06062, 2023 - arxiv.org
Proxy applications (proxy-apps) are basic tools for evaluating the performance of specific
workloads on high-performance computing (HPC) systems. Since the development of high …

Graph Analysis for Scalability Analysis

J Zhai, Y **, W Chen, W Zheng - … Analysis of Parallel Applications for HPC, 2023 - Springer
Scaling a parallel program to modern supercomputers is challenging due to inter-process
communication, Amdahl's law, and resource contention. Performance analysis tools for …

Domain-Specific Framework for Performance Analysis

J Zhai, Y **, W Chen, W Zheng - … Analysis of Parallel Applications for HPC, 2023 - Springer
In this book, we propose several performance analysis approaches for communication
analysis, memory monitoring, etc. However, to implement each such analysis, significant …

[BOOK][B] Towards Scale-Checkable Systems

CAS Moraga - 2022 - search.proquest.com
In this document, we present our approaches for understanding and discovering scalability
faults, ie faults whose symptoms appear at larger scales but are not visible at smaller scales …