The TAU parallel performance system

SS Shende, AD Malony - The International Journal of High …, 2006 - journals.sagepub.com
The ability of performance technology to keep pace with the growing complexity of parallel
and distributed systems depends on robust performance frameworks that can at once …

FLEX-MPI: an MPI extension for supporting dynamic load balancing on heterogeneous non-dedicated systems

G Martin, MC Marinescu, DE Singh… - Euro-Par 2013 Parallel …, 2013 - Springer
This paper introduces FLEX-MPI, a novel runtime approach for the dynamic load balancing
of MPI-based SPMD applications running on heterogeneous platforms in the presence of …

[PDF][PDF] Adaptive main memory compression

IC Tuduce, TR Gross - 2005 - usenix.org
Applications that use large data sets frequently exhibit poor performance because the size of
their working set exceeds the real memory, causing excess page faults, and ultimately …

A fast parallel algorithm for selected inversion of structured sparse matrices with application to 2D electronic structure calculations

L Lin, C Yang, J Lu, L Ying, WE - SIAM Journal on Scientific Computing, 2011 - SIAM
An efficient parallel algorithm is presented for computing selected components of A^-1
where A is a structured symmetric sparse matrix. Calculations of this type are useful for …

Combining instrumentation and sampling for trace-based application performance analysis

T Ilsche, J Schuchart, R Schöne… - Tools for High …, 2015 - Springer
Performance analysis is vital for optimizing the execution of high performance computing
applications. Today different techniques for gathering, processing, and analyzing application …

PAPI deployment, evaluation, and extensions

S Moore, D Terpstra, K London, P Mucci… - 2003 User Group …, 2003 - ieeexplore.ieee.org
PAPI is a cross-platform interface to the hardware performance counters available on most
modern microprocessors. These counters exist as a small set of registers that count events …

Methodology for modelling SPMD hybrid parallel computation.

LM Liebrock, SP Goudy - Concurrency & Computation …, 2008 - search.ebscohost.com
This research defines and analyzes a methodology for deriving a performance model for
SPMD hybrid parallel applications. Hybrid parallelism combines sharedmemory and …

Caracterización de aplicaciones con comportamiento irregular para predecir su rendimiento, basado en la filosofía PAS2P

FL Tirado Marabolí - 2024 - ddd.uab.cat
El modelado de aplicaciones científicas paralelas nos permite conocer los detalles del
comportamiento de las aplicaciones paralelas. Muchas aplicaciones científicas tienen un …

High-performance three-dimensional image reconstruction for molecular structure determination

J Chung, P Sternberg, C Yang - The International Journal of …, 2010 - journals.sagepub.com
We describe an efficient parallel implementation of a reliable iterative reconstruction
algorithm for estimating the three-dimensional (3D) density map of a macromolecular …

Inherent replica inconsistency in cassandra

X Huang, J Wang, J Bai, G Ding… - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
Inherent replica inconsistency refers to the difference among the replicas of the same logical
data item in the write propagation process of a normally running distributed storage system …