Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey of numerical linear algebra methods utilizing mixed-precision arithmetic
The efficient utilization of mixed-precision numerical linear algebra algorithms can offer
attractive acceleration to scientific computing applications. Especially with the hardware …
attractive acceleration to scientific computing applications. Especially with the hardware …
Efficient parallel implementations of sparse triangular solves for GPU architectures
The sparse triangular matrix solve (SpTrSV) is an important computation kernel that is
demanded by a variety of numerical methods such as the Gauss-Seidel iterations. However …
demanded by a variety of numerical methods such as the Gauss-Seidel iterations. However …
A scalable geometric multigrid solver for nonsymmetric elliptic systems with application to variable-density flows
A geometric multigrid algorithm is introduced for solving nonsymmetric linear systems
resulting from the discretization of the variable density Navier–Stokes equations on …
resulting from the discretization of the variable density Navier–Stokes equations on …
A new class of amg interpolation methods based on matrix-matrix multiplications
A new class of distance-two interpolation methods for algebraic multigrid (AMG) that can be
formulated in terms of sparse matrix-matrix multiplications is presented and analyzed …
formulated in terms of sparse matrix-matrix multiplications is presented and analyzed …
Stability analysis and performance evaluation of additive mixed-precision Runge-Kutta methods
Abstract Additive Runge-Kutta methods designed for preserving highly accurate solutions in
mixed-precision computation were previously proposed and analyzed. These specially …
mixed-precision computation were previously proposed and analyzed. These specially …
A two-level GPU-accelerated incomplete LU preconditioner for general sparse linear systems
This paper presents a parallel preconditioning approach based on incomplete LU (ILU)
factorizations in the framework of Domain Decomposition (DD) for general sparse linear …
factorizations in the framework of Domain Decomposition (DD) for general sparse linear …
FP16 Acceleration in Structured Multigrid Preconditioner for Real-World Applications
Half-precision hardware support is now almost ubiquitous. In contrast to its active use in AI,
half-precision is less commonly employed in scientific and engineering computing. The …
half-precision is less commonly employed in scientific and engineering computing. The …
Pipelined iterative solvers with kernel fusion for graphics processing units
We revisit the implementation of iterative solvers on discrete graphics processing units and
demonstrate the benefit of implementations using extensive kernel fusion for pipelined …
demonstrate the benefit of implementations using extensive kernel fusion for pipelined …
Accelerating geometric multigrid preconditioning with half-precision arithmetic on GPUs
KL Oo, A Vogel - arxiv preprint arxiv:2007.07539, 2020 - arxiv.org
With the hardware support for half-precision arithmetic on NVIDIA V100 GPUs, high-
performance computing applications can benefit from lower precision at appropriate spots to …
performance computing applications can benefit from lower precision at appropriate spots to …
Tusas: A fully implicit parallel approach for coupled phase-field equations
We develop a fully-coupled, fully-implicit approach for phase-field modeling of solidification
in metals and alloys. Predictive simulation of solidification in pure metals and metal alloys …
in metals and alloys. Predictive simulation of solidification in pure metals and metal alloys …