Productivity, performance, and portability for computational fluid dynamics applications

IZ Reguly, GR Mudalige - Computers & Fluids, 2020 - Elsevier
Hardware trends over the last decade show increasing complexity and heterogeneity in high
performance computing architectures, which presents developers of CFD applications with …

OpenSBLI v3. 0: High-fidelity multi-block transonic aerofoil CFD simulations using domain specific languages on GPUs

DJ Lusher, A Sansica, ND Sandham, J Meng… - Computer Physics …, 2025 - Elsevier
OpenSBLI is an automatic code-generation framework for compressible Computational Fluid
Dynamics (CFD) simulations on heterogeneous computing architectures (previous release …

Breaking the computation and communication abstraction barrier in distributed machine learning workloads

A Jangda, J Huang, G Liu, AHN Sabet… - Proceedings of the 27th …, 2022 - dl.acm.org
Recent trends towards large machine learning models require both training and inference
tasks to be distributed. Considering the huge cost of training these models, it is imperative to …

Automatic code generation and optimization of large-scale stencil computation on many-core processors

M Li, Y Liu, H Yang, Y Hu, Q Sun, B Chen… - Proceedings of the 50th …, 2021 - dl.acm.org
Stencil computation is an indispensable building block of many scientific applications and is
widely used by the numerical solvers of partial differential equations (PDEs). Due to the …

Towards virtual certification of gas turbine engines with performance-portable simulations

GR Mudalige, IZ Reguly, A Prabhakar… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
We present the large-scale, computational fluid dy-namics (CFD) simulation of a full gas-
turbine engine compressor, demonstrating capability towards overcoming current limitations …

Pencil: A pipelined algorithm for distributed stencils

H Wang… - … Conference for High …, 2020 - ieeexplore.ieee.org
Stencil computations are at the core of various Computational Fluid Dynamics (CFD)
applications and have been well-studied for several decades. Typically they're highly …

Large-scale performance of a DSL-based multi-block structured-mesh application for Direct Numerical Simulation

GR Mudalige, IZ Reguly, SP Jammy, CT Jacobs… - Journal of Parallel and …, 2019 - Elsevier
Abstract SBLI (Shock-wave/Boundary-layer Interaction) is a large-scale Computational Fluid
Dynamics (CFD) application, developed over 20 years at the University of Southampton and …

Auke: Automatic kernel code generation for an analogue simd focal-plane sensor-processor array

T Debrunner, S Saeedi, PHJ Kelly - ACM Transactions on Architecture …, 2019 - dl.acm.org
Focal-plane Sensor-Processor Arrays (FPSPs) are new imaging devices with parallel Single
Instruction Multiple Data (SIMD) computational capabilities built into every pixel. Compared …

Effect of Trip** and Domain Width on Transonic Buffet on Periodic NASA-CRM Airfoils

DJ Lusher, A Sansica, A Hashimoto - AIAA Journal, 2024 - arc.aiaa.org
Transonic buffet is an instability characterized by shock oscillations and separated boundary
layers. High-fidelity simulations have typically been limited to narrow domains to be …

Scheduling languages: A past, present, and future taxonomy

M Hall, C Oancea, AC Elster, A Rasch, S Joshi… - arxiv preprint arxiv …, 2024 - arxiv.org
Scheduling languages express to a compiler a sequence of optimizations to apply.
Compilers that support a scheduling language interface allow exploration of compiler …