Higher-level parallelization for local and distributed asynchronous task-based programming

H Kaiser, T Heller, D Bourgeois, D Fey - Proceedings of the First …, 2015 - dl.acm.org
One of the biggest challenges on the way to exascale computing is programmability in the
context of performance portability. The efficient utilization of the prospective architectures of …

[HTML][HTML] An environmental modelling framework based on asynchronous many-tasks: Scalability and usability

K de Jong, D Panja, M van Kreveld… - … Modelling & Software, 2021 - Elsevier
Environmental modelling frameworks allow domain experts, rather than software
developers, to implement and run numerical simulation models in earth and environmental …

Enabling FPGA and AI Engine Tasks in the HPX Programming Framework for Heterogeneous High-Performance Computing

T Kalkhof, C Heinz, A Koch - International Symposium on Applied …, 2024 - Springer
The increasing complexity of modern exascale computers, with a growing number of cores
per node, poses a challenge to traditional programming models. To address this challenge …

Assessing the performance impact of using an active global address space in hpx: A case for agas

P Amini, H Kaiser - 2019 IEEE/ACM Third Annual Workshop on …, 2019 - ieeexplore.ieee.org
In this research, we describe the functionality of AGAS (Active Global Address Space), a
subsystem of the HPX runtime system that is designed to handle data locality at runtime …

Analysis and optimization of task granularity on the Java virtual machine

A Rosà, E Rosales, W Binder - ACM Transactions on Programming …, 2019 - dl.acm.org
Task granularity, ie, the amount of work performed by parallel tasks, is a key performance
attribute of parallel applications. On the one hand, fine-grained tasks (ie, small tasks carrying …

Methodology for adaptive active message coalescing in task based runtime systems

B Wagle, S Kellar, A Serio… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Overheads associated with fine grained communication in task based runtime systems are
one of the major bottlenecks that limit the performance of distributed applications. In this …

Shared Memory Parallelism in Modern C++ and HPX

P Diehl, SR Brandt, H Kaiser - SN Computer Science, 2024 - Springer
Parallel programming remains a daunting challenge, from struggling to express a parallel
algorithm without cluttering the underlying synchronous logic to describing which tools to …

Using intrinsic performance counters to assess efficiency in task-based parallel applications

P Grubel, H Kaiser, K Huck… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
The ability to measure performance characteristics of an application at runtime is essential
for monitoring the behavior of the application and the runtime system on the underlying …

Performance comparison of HPX versus traditional parallelization strategies for the discontinuous Galerkin method

M Bremer, K Kazhyken, H Kaiser, C Michoski… - Journal of Scientific …, 2019 - Springer
As high performance computing moves towards the exascale computing regime,
applications are required to expose increasingly fine grain parallelism to efficiently use next …

[KNJIGA][B] Dynamic adaptation in hpx-a task-based parallel runtime system

PA Grubel - 2016 - search.proquest.com
As parallel computation enters the exascale era where applications may run on millions to
billions of processors concurrently, all aspects of the computational model need to undergo …