A stable multi-scale kernel for topological machine learning
Topological data analysis offers a rich source of valuable information to study vision
problems. Yet, so far we lack a theoretically sound connection to popular kernel-based …
problems. Yet, so far we lack a theoretically sound connection to popular kernel-based …
Likwid: A lightweight performance-oriented tool suite for x86 multicore environments
Exploiting the performance of today's processors requires intimate knowledge of the
microarchitecture as well as an awareness of the ever-growing complexity in thread and …
microarchitecture as well as an awareness of the ever-growing complexity in thread and …
Score-P: A unified performance measurement system for petascale applications
The rapidly growing number of cores on modern supercomputers imposes scalability
demands not only on applications but also on the software tools needed for their …
demands not only on applications but also on the software tools needed for their …
Automatic performance analysis with periscope
M Gerndt, M Ott - Concurrency and Computation: Practice and …, 2010 - Wiley Online Library
Performance analysis is essential to fully exploit the potential of high‐performance
computers. With the imminence of petascale systems which will consist of ten thousands or …
computers. With the imminence of petascale systems which will consist of ten thousands or …
LIKWID: lightweight performance tools
Exploiting the performance of today's microprocessors requires intimate knowledge of the
microarchitecture as well as an awareness of the ever-growing complexity in thread and …
microarchitecture as well as an awareness of the ever-growing complexity in thread and …
SOMA: Observability, monitoring, and in situ analytics for exascale applications
With the rise of exascale systems and large, data‐centric workflows, the need to observe
and analyze high performance computing (HPC) applications during their execution is …
and analyze high performance computing (HPC) applications during their execution is …
On-line detection of large-scale parallel application's structure
With larger and larger systems being constantly deployed, trace-based performance
analysis of parallel applications has become a challenging task. Even if the amount of …
analysis of parallel applications has become a challenging task. Even if the amount of …
Diogenes: Looking for an honest cpu/gpu performance measurement tool
GPU accelerators have become common on today's leadership-class computing platforms.
Exploiting the additional parallelism offered by GPUs is fraught with challenges. A key …
Exploiting the additional parallelism offered by GPUs is fraught with challenges. A key …
A novel context-based risk assessment approach in vehicular networks
Vehicular Networks (VANET) are the largest real life application of ad-hoc networks where
nodes are represented via fast moving vehicles. As VANET is characterised with several …
nodes are represented via fast moving vehicles. As VANET is characterised with several …
Distributed wait state tracking for runtime MPI deadlock detection
The widely used Message Passing Interface (MPI) with its multitude of communication
functions is prone to usage errors. Runtime error detection tools aid in the removal of these …
functions is prone to usage errors. Runtime error detection tools aid in the removal of these …