Vfc: The vienna fortran compiler

S Benkner - Scientific Programming, 1999 - content.iospress.com
Abstract High Performance Fortran (HPF) offers an attractive high‐level language interface
for programming scalable parallel architectures providing the user with directives for the …

Performance analysis of distributed applications using automatic classification of communication inefficiencies

J Vetter - Proceedings of the 14th international conference on …, 2000 - dl.acm.org
We present a technique for performance analysis that helps users understand the
communication behavior of their message passing applications. Our method automatically …

Method and system for automatically testing performance of applications run in a distributed processing structure and corresponding computer program product

A Nasuto, D Gotta - US Patent 8,726,243, 2014 - Google Patents
US8726243B2 - Method and system for automatically testing performance of applications run in
a distributed processing structure and corresponding computer program product - Google …

Virtue: Performance visualization of parallel and distributed applications

E Shaffer, DA Reed, S Whitmore, B Schaeffer - Computer, 1999 - ieeexplore.ieee.org
High-speed, wide-area networks have made it both possible and desirable to interconnect
geographically distributed applications that control distributed collections of scientific data …

Configuration independent analysis for characterizing shared-memory applications

GA Abandah, ES Davidson - Proceedings of the First Merged …, 1998 - ieeexplore.ieee.org
The paper demonstrates that configuration independent analysis of shared memory
applications is useful tool to characterize inherent application characteristics that do not …

CATCH—a call-graph based automatic tool for capture of hardware performance metrics for MPI and OpenMP applications

L DeRose, F Wolf - Euro-Par 2002 Parallel Processing: 8th International …, 2002 - Springer
Catch is a profiler for parallel applications that collects hardware performance counters
information for each function called in the program, based on the path that led to the function …

[LIVRE][B] Input/output intensive massively parallel computing: language support, automatic parallelization, advanced optimization, and runtime systems

P Brezany - 1997 - books.google.com
Massively parallel processing is currently the most promising answer to the quest for
increased computer performance. This has resulted in the development of new …

Redistribution strategies for portable parallel FFT: a case study

A Dubey, D Tessera - Concurrency and Computation: Practice …, 2001 - Wiley Online Library
The best approach to parallelize multidimensional FFT algorithms has long been under
debate. Distributed transposes are widely used, but they also vary in communication policies …

[PDF][PDF] A visualization tool for analyzing cluster performance data

R Haynes, P Crossno, E Russell - Third IEEE International …, 2001 - scholar.archive.org
This paper describes a unique visualization tool that has been used to analyze performance
of the Cplant™ clusters [13] at Sandia National Laboratories. As commodity cluster systems …

Parallel architectures: Performance prediction: A case study using a scalable shared-virtual-memory machine

XH Sun, J Zhu - … Parallel & Distributed Technology: Systems & …, 1996 - ieeexplore.ieee.org
Performance Prediction Page 1 Parallel Architectures Performance Prediction A Case Study
Using a Scalable Shared-Virtual-Memory Machine **an-He Sun Louisiana State University …