[HTML][HTML] Influence of data amount, data type and implementation packages in GPU coding

P Xu, MY Sun, YJ Gao, TJ Du, JM Hu, JJ Zhang - Array, 2022 - Elsevier
Abstract Graphic Processing Units (GPUs) are becoming popular in computational physics.
Seeing the increasing trend of using GPUs in the physics community, we provide a …

An efficient radial basis functions mesh deformation with greedy algorithm based on recurrence Choleskey decomposition and parallel computing

H Fang, Y Hu, C Yu, M Tie, J Liu, C Gong - Journal of Computational …, 2019 - Elsevier
The mesh deformation method based on radial basis functions (RBF) has many advantages
and is widely used. RBF based mesh deformation method mainly has two steps: data …

Assessment of a new hybrid-SSOR implicit temporal scheme for turbulent flows across a wide range of Mach numbers

J Liu, J Chen, Z Zhang, Y Yang, Z **ao - Acta Mechanica Sinica, 2023 - Springer
The convergent efficiency and numerical stability of temporal discretization schemes for
Navier-Stokes (NS) equations are significant for engineering turbulent flow simulations …

[HTML][HTML] Efficient mesh deformation based on Cartesian background mesh

H Fang, C Gong, C Yu, C Min, X Zhang, J Liu… - … & Mathematics with …, 2017 - Elsevier
Moving mesh is widely used in the simulation of aerodynamic shape optimization, multibody
relative motion, aircraft icing and aeroelasticity. The efficient and high quality mesh …

Petascale scramjet combustion simulation on the Tianhe-2 heterogeneous supercomputer

Y Che, M Yang, C Xu, Y Lu - Parallel Computing, 2018 - Elsevier
Combustion simulation is complex and computationally expensive as it involves integration
of fundamental chemical kinetics and multidimensional Computational Fluid Dynamics …

A hierarchical wavefront method for LU-SGS

K Komatsu, Y Hougi, M Sato, H Kobayashi - Computers & Fluids, 2022 - Elsevier
Abstract The lower–upper Symmetric-Gauss–Seidel (LU-SGS) method is one of typical
implicit methods, especially for an application that requires high convergence and accuracy …

[HTML][HTML] An implicit gas-kinetic scheme for turbulent flow on unstructured hybrid mesh

D Pan, C Zhong, C Zhuo - Computers & Mathematics with Applications, 2018 - Elsevier
In this study, an implicit scheme for the gas-kinetic scheme (GKS) on the unstructured hybrid
mesh is proposed. The Spalart–Allmaras (SA) one equation turbulence model is …

[PDF][PDF] 新型高性能计算系统与技术

廖湘科, 肖侬 - **科学: 信息科学, 2016 - scis.scichina.com
摘要高性能计算技术是信息时代世界各国特别是发达国家激烈竞争的技术制高点.
本文针对未来新型高性能计算技术的挑战, 从微处理器, 高性能计算机系统 …

An efficient image to column algorithm for convolutional neural networks

C Gong, X Chen, S Lv, J Liu, B Yang… - … Joint Conference on …, 2021 - ieeexplore.ieee.org
Convolutional Neural Networks (CNNs) are a class of deep neural networks. The image to
column (im2col) procedure is an important step for CNN and consumes about 28.8% of the …

Implementation and Optimization of Double-Precision Floating-Point Exponential Functions on ARMv8 NEON Architecture

R Luo, X Cheng, D Hu, X Zhang… - … on Advanced Sensing …, 2023 - ieeexplore.ieee.org
The natural exponential function is an important basic arithmetic function with a wide
application in many fields such as artificial intelligence. In order to satisfy the demands of …