Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Evaluation of compilers' capability of automatic vectorization based on source code analysis
JG Feng, YP He, QM Tao - Scientific Programming, 2021 - Wiley Online Library
Automatic vectorization is an important technique for compilers to improve the parallelism of
programs. With the widespread usage of SIMD (Single Instruction Multiple Data) extensions …
programs. With the widespread usage of SIMD (Single Instruction Multiple Data) extensions …
AnyDSL: a partial evaluation framework for programming high-performance libraries
This paper advocates programming high-performance code using partial evaluation. We
present a clean-slate programming system with a simple, annotation-based, online partial …
present a clean-slate programming system with a simple, annotation-based, online partial …
Side-channel elimination via partial control-flow linearization
Partial control-flow linearization is a code transformation conceived to maximize work
performed in vectorized programs. In this article, we find a new service for it. We show that …
performed in vectorized programs. In this article, we find a new service for it. We show that …
All you need is superword-level parallelism: systematic control-flow vectorization with SLP
Superword-level parallelism (SLP) vectorization is a proven technique for vectorizing
straight-line code. It works by replacing independent, isomorphic instructions with equivalent …
straight-line code. It works by replacing independent, isomorphic instructions with equivalent …
Vectorization-aware loop unrolling with seed forwarding
Loop unrolling is a widely adopted loop transformation, commonly used for enabling
subsequent optimizations. Straight-line-code vectorization (SLP) is an optimization that …
subsequent optimizations. Straight-line-code vectorization (SLP) is an optimization that …
DARM: control-flow melding for SIMT thread divergence reduction
GPGPUs use the Single-Instruction-Multiple-Thread (SIMT) execution model where a group
of threads—wavefront or warp—execute instructions in lockstep. When threads in a group …
of threads—wavefront or warp—execute instructions in lockstep. When threads in a group …
Specfuscator: Evaluating branch removal as a Spectre mitigation
Attacks exploiting speculative execution, known as Spectre attacks, have gained substantial
attention in the scientific community and in industry with a broad range of defense …
attention in the scientific community and in industry with a broad range of defense …
Super-Node SLP: Optimized vectorization for code sequences containing operators and their inverse elements
SLP Auto-vectorization converts straight-line code into vector code. It scans input code for
groups of instructions that can be combined into vectors and replaces them with their …
groups of instructions that can be combined into vectors and replaces them with their …
Memory-safe elimination of side channels
A program is said to be isochronous if its running time does not depend on classified
information. The programming languages literature contains much work that transforms …
information. The programming languages literature contains much work that transforms …
Pointer-based divergence analysis for OpenCL 2.0 programs
SC Wang, LY Yu, LA Her, YS Hwang… - ACM Transactions on …, 2021 - dl.acm.org
A modern GPU is designed with many large thread groups to achieve a high throughput and
performance. Within these groups, the threads are grouped into fixed-size SIMD batches in …
performance. Within these groups, the threads are grouped into fixed-size SIMD batches in …