Optimization techniques for GPU programming
In the past decade, Graphics Processing Units have played an important role in the field of
high-performance computing and they still advance new fields such as IoT, autonomous …
high-performance computing and they still advance new fields such as IoT, autonomous …
Massively parallel lattice–Boltzmann codes on large GPU clusters
This paper describes a massively parallel code for a state-of-the art thermal lattice–
Boltzmann method. Our code has been carefully optimized for performance on one GPU and …
Boltzmann method. Our code has been carefully optimized for performance on one GPU and …
A new Neumann boundary condition scheme for the thermal lattice Boltzmann method
In this paper we propose a new scheme for implementing the Neumann boundary condition
(BC) with the thermal Lattice Boltzmann Method (LBM). It consists in transforming the wall …
(BC) with the thermal Lattice Boltzmann Method (LBM). It consists in transforming the wall …
Evaluation of DVFS techniques on modern HPC processors and accelerators for energy‐aware applications
Energy efficiency is becoming increasingly important for computing systems, in particular for
large scale High Performance Computing (HPC) facilities. In this work, we evaluate, from a …
large scale High Performance Computing (HPC) facilities. In this work, we evaluate, from a …
Characterization of petrophysical properties using pore-network and lattice-Boltzmann modelling: Choice of method and image sub-volume size
N Alyafei, TJ Mckay, TI Solling - Journal of Petroleum Science and …, 2016 - Elsevier
The invention and progression of micro-CT scanning technology has significantly improved
the quality and resolution of tomographic images. It is now possible to fully resolve simpler …
the quality and resolution of tomographic images. It is now possible to fully resolve simpler …
Performance and portability of accelerated lattice Boltzmann applications with OpenACC
An increasingly large number of HPC systems rely on heterogeneous architectures
combining traditional multi‐core CPUs with power efficient accelerators. Designing efficient …
combining traditional multi‐core CPUs with power efficient accelerators. Designing efficient …
Performance and power analysis of hpc workloads on heterogeneous multi-node clusters
Performance analysis tools allow application developers to identify and characterize the
inefficiencies that cause performance degradation in their codes, allowing for application …
inefficiencies that cause performance degradation in their codes, allowing for application …
Optimization of lattice Boltzmann simulations on heterogeneous computers
High-performance computing systems are more and more often based on accelerators.
Computing applications targeting those systems often follow a host-driven approach, in …
Computing applications targeting those systems often follow a host-driven approach, in …
Early experience on porting and running a Lattice Boltzmann code on the Xeon-Phi co-processor
In this paper we report on our early experience on porting, optimizing and benchmarking a
Lattice Boltzmann (LB) code on the Xeon-Phi co-processor, the first generally available …
Lattice Boltzmann (LB) code on the Xeon-Phi co-processor, the first generally available …
Physically based visual simulation of the Lattice Boltzmann method on the GPU: a survey
O Navarro-Hinojosa, S Ruiz-Loza… - The Journal of …, 2018 - Springer
The rapid increase in performance, programmability, and availability of graphics processing
units (GPUs) has made them a compelling platform for computationally demanding tasks in …
units (GPUs) has made them a compelling platform for computationally demanding tasks in …