A stable multi-scale kernel for topological machine learning

J Reininghaus, S Huber, U Bauer… - Proceedings of the …, 2015 - openaccess.thecvf.com
Topological data analysis offers a rich source of valuable information to study vision
problems. Yet, so far we lack a theoretically sound connection to popular kernel-based …

Likwid: A lightweight performance-oriented tool suite for x86 multicore environments

J Treibig, G Hager, G Wellein - 2010 39th international …, 2010 - ieeexplore.ieee.org
Exploiting the performance of today's processors requires intimate knowledge of the
microarchitecture as well as an awareness of the ever-growing complexity in thread and …

Score-P: A unified performance measurement system for petascale applications

DA Mey, S Biersdorf, C Bischof, K Diethelm… - Competence in High …, 2012 - Springer
The rapidly growing number of cores on modern supercomputers imposes scalability
demands not only on applications but also on the software tools needed for their …

Automatic performance analysis with periscope

M Gerndt, M Ott - Concurrency and Computation: Practice and …, 2010 - Wiley Online Library
Performance analysis is essential to fully exploit the potential of high‐performance
computers. With the imminence of petascale systems which will consist of ten thousands or …

LIKWID: lightweight performance tools

J Treibig, G Hager, G Wellein - Competence in High Performance …, 2011 - Springer
Exploiting the performance of today's microprocessors requires intimate knowledge of the
microarchitecture as well as an awareness of the ever-growing complexity in thread and …

SOMA: Observability, monitoring, and in situ analytics for exascale applications

D Yokelson, O Lappi, S Ramesh… - Concurrency and …, 2024 - Wiley Online Library
With the rise of exascale systems and large, data‐centric workflows, the need to observe
and analyze high performance computing (HPC) applications during their execution is …

On-line detection of large-scale parallel application's structure

G Llort, J Gonzalez, H Servat… - … on Parallel & …, 2010 - ieeexplore.ieee.org
With larger and larger systems being constantly deployed, trace-based performance
analysis of parallel applications has become a challenging task. Even if the amount of …

Diogenes: Looking for an honest cpu/gpu performance measurement tool

B Welton, BP Miller - Proceedings of the International Conference for …, 2019 - dl.acm.org
GPU accelerators have become common on today's leadership-class computing platforms.
Exploiting the additional parallelism offered by GPUs is fraught with challenges. A key …

A novel context-based risk assessment approach in vehicular networks

F Ahmad, A Adnane - 2016 30th International Conference on …, 2016 - ieeexplore.ieee.org
Vehicular Networks (VANET) are the largest real life application of ad-hoc networks where
nodes are represented via fast moving vehicles. As VANET is characterised with several …

Distributed wait state tracking for runtime MPI deadlock detection

T Hilbrich, BR de Supinski, WE Nagel, J Protze… - Proceedings of the …, 2013 - dl.acm.org
The widely used Message Passing Interface (MPI) with its multitude of communication
functions is prone to usage errors. Runtime error detection tools aid in the removal of these …