The TAU parallel performance system
The ability of performance technology to keep pace with the growing complexity of parallel
and distributed systems depends on robust performance frameworks that can at once …
and distributed systems depends on robust performance frameworks that can at once …
FLEX-MPI: an MPI extension for supporting dynamic load balancing on heterogeneous non-dedicated systems
This paper introduces FLEX-MPI, a novel runtime approach for the dynamic load balancing
of MPI-based SPMD applications running on heterogeneous platforms in the presence of …
of MPI-based SPMD applications running on heterogeneous platforms in the presence of …
[PDF][PDF] Adaptive main memory compression
IC Tuduce, TR Gross - 2005 - usenix.org
Applications that use large data sets frequently exhibit poor performance because the size of
their working set exceeds the real memory, causing excess page faults, and ultimately …
their working set exceeds the real memory, causing excess page faults, and ultimately …
A fast parallel algorithm for selected inversion of structured sparse matrices with application to 2D electronic structure calculations
An efficient parallel algorithm is presented for computing selected components of A^-1
where A is a structured symmetric sparse matrix. Calculations of this type are useful for …
where A is a structured symmetric sparse matrix. Calculations of this type are useful for …
Combining instrumentation and sampling for trace-based application performance analysis
Performance analysis is vital for optimizing the execution of high performance computing
applications. Today different techniques for gathering, processing, and analyzing application …
applications. Today different techniques for gathering, processing, and analyzing application …
PAPI deployment, evaluation, and extensions
PAPI is a cross-platform interface to the hardware performance counters available on most
modern microprocessors. These counters exist as a small set of registers that count events …
modern microprocessors. These counters exist as a small set of registers that count events …
Methodology for modelling SPMD hybrid parallel computation.
LM Liebrock, SP Goudy - Concurrency & Computation …, 2008 - search.ebscohost.com
This research defines and analyzes a methodology for deriving a performance model for
SPMD hybrid parallel applications. Hybrid parallelism combines sharedmemory and …
SPMD hybrid parallel applications. Hybrid parallelism combines sharedmemory and …
Caracterización de aplicaciones con comportamiento irregular para predecir su rendimiento, basado en la filosofía PAS2P
FL Tirado Marabolí - 2024 - ddd.uab.cat
El modelado de aplicaciones científicas paralelas nos permite conocer los detalles del
comportamiento de las aplicaciones paralelas. Muchas aplicaciones científicas tienen un …
comportamiento de las aplicaciones paralelas. Muchas aplicaciones científicas tienen un …
High-performance three-dimensional image reconstruction for molecular structure determination
We describe an efficient parallel implementation of a reliable iterative reconstruction
algorithm for estimating the three-dimensional (3D) density map of a macromolecular …
algorithm for estimating the three-dimensional (3D) density map of a macromolecular …
Inherent replica inconsistency in cassandra
Inherent replica inconsistency refers to the difference among the replicas of the same logical
data item in the write propagation process of a normally running distributed storage system …
data item in the write propagation process of a normally running distributed storage system …