Jet: Fast quantum circuit simulations with parallel task-based tensor-network contraction

T Vincent, LJ O'Riordan, M Andrenkov, J Brown… - Quantum, 2022 - quantum-journal.org
We introduce a new open-source software library $ Jet $, which uses task-based parallelism
to obtain speed-ups in classical tensor-network simulations of quantum circuits. These …

Betelgeuse: a review

JC Wheeler, E Chatzopoulos - Astronomy & Geophysics, 2023 - academic.oup.com
Was Betelgeuse once in a binary star system? What causes it to vary over a vast range of
timescales? Why did it dim dramatically in 2020? When and how will it explode? J. Craig …

Stellar mergers with hpx-kokkos and sycl: Methods of using an asynchronous many-task runtime system with sycl

G Daiß, P Diehl, H Kaiser, D Pflüger - Proceedings of the 2023 …, 2023 - dl.acm.org
Ranging from NVIDIA GPUs to AMD GPUs and Intel GPUs: Given the heterogeneity of
available accelerator cards within current supercomputers, portability is a key aspect for …

Betelgeuse as a Merger of a Massive Star with a Companion

S Shiber, E Chatzopoulos, B Munson… - The Astrophysical …, 2024 - iopscience.iop.org
We investigate the merger between a 16M⊙ star, on its way to becoming a red supergiant
(RSG), and a 4M⊙ main-sequence companion. Our study employs three-dimensional …

Simulating stellar merger using HPX/Kokkos on A64FX on Supercomputer Fugaku

P Diehl, G Daiß, K Huck, D Marcello, S Shiber… - The Journal of …, 2024 - Springer
The increasing availability of machines relying on non-GPU architectures, such as ARM
A64FX in high-performance computing, provides a set of interesting challenges to …

Beyond fork-join: Integration of performance portable Kokkos kernels with HPX

G Daiß, M Simberg, A Reverdell… - 2021 IEEE …, 2021 - ieeexplore.ieee.org
Between a widening range of GPU vendors and the trend of having more GPUs per compute
node in supercomputers such as Summit, Perlmutter, Frontier and Aurora, develo** …

[PDF][PDF] Methodological characterization and computational codes in the simulation of interacting galaxies

E Teófilo-Salvador, P Ambrocio-Cruz… - Artificial Intelligence …, 2024 - scholar.archive.org
Currently, technological development has exponentially fostered a growing collection of
dispersed and diversified information. In the case of galaxy interaction studies, it is important …

From task-based gpu work aggregation to stellar mergers: Turning fine-grained cpu tasks into portable gpu kernels

G Daiß, P Diehl, D Marcello… - 2022 IEEE/ACM …, 2022 - ieeexplore.ieee.org
Meeting both scalability and performance portability requirements is a challenge for any
HPC application, especially for adaptively refined ones. In Octo-Tiger, an astrophysics …

Design and Analysis of the Network Software Stack of an Asynchronous Many-task System--The LCI parcelport of HPX

J Yan, H Kaiser, M Snir - Proceedings of the SC'23 Workshops of the …, 2023 - dl.acm.org
The HPX asynchronous many-task runtime system has been using TCP and MPI as its
communication backends (parcelports). We developed a new HPX parcelport using a new …

From merging frameworks to merging stars: Experiences using hpx, kokkos and simd types

G Daiß, SY Singanaboina, P Diehl… - 2022 IEEE/ACM 7th …, 2022 - ieeexplore.ieee.org
Octo-Tiger, a large-scale 3D AMR code for the merger of stars, uses a combination of HPX,
Kokkos and explicit SIMD types, aiming to achieve performance-portability for a broad range …