Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Optimization techniques for GPU programming
In the past decade, Graphics Processing Units have played an important role in the field of
high-performance computing and they still advance new fields such as IoT, autonomous …
high-performance computing and they still advance new fields such as IoT, autonomous …
MIMD programs execution support on SIMD machines: a holistic survey
The Single Instruction Multiple Data (SIMD) architecture, supported by various high-
performance computing platforms, efficiently utilizes data-level parallelism. The SIMD model …
performance computing platforms, efficiently utilizes data-level parallelism. The SIMD model …
A benchmark set of highly-efficient CUDA and OpenCL kernels and its dynamic autotuning with Kernel Tuning Toolkit
In recent years, the heterogeneity of both commodity and supercomputers hardware has
increased sharply. Accelerators, such as GPUs or Intel Xeon Phi co-processors, are often …
increased sharply. Accelerators, such as GPUs or Intel Xeon Phi co-processors, are often …
cuFINUFFT: a load-balanced GPU library for general-purpose nonuniform FFTs
Nonuniform fast Fourier transforms dominate the computational cost in many applications
including image reconstruction and signal processing. We thus present a general-purpose …
including image reconstruction and signal processing. We thus present a general-purpose …
Advances in xmipp for cryo–electron microscopy: From xmipp to scipion
D Strelak, A Jiménez-Moreno, JL Vilas… - Molecules, 2021 - mdpi.com
Xmipp is an open-source software package consisting of multiple programs for processing
data originating from electron microscopy and electron tomography, designed and managed …
data originating from electron microscopy and electron tomography, designed and managed …
A survey of performance tuning techniques and tools for parallel applications
D Mustafa - IEEE Access, 2022 - ieeexplore.ieee.org
Automatic parallelization of sequential programs combined with auto-tuning is an alternative
to manual parallelization. With wider research directions and the increased number of …
to manual parallelization. With wider research directions and the increased number of …
Using hardware performance counters to speed up autotuning convergence on GPUs
Nowadays, GPU accelerators are commonly used to speed up general-purpose computing
tasks on a variety of hardware. However, due to the diversity of GPU architectures and …
tasks on a variety of hardware. However, due to the diversity of GPU architectures and …
Estimating resource budgets to ensure autotuning efficiency
Many state-of-the-art HPC applications rely on autotuning to maintain peak performance.
Autotuning allows a program to be re-optimized for new hardware, settings, or input–even …
Autotuning allows a program to be re-optimized for new hardware, settings, or input–even …
Umpalumpa: a framework for efficient execution of complex image processing workloads on heterogeneous nodes
D Střelák, D Myška, F Petrovič, J Polák, J Ol'ha… - Computing, 2023 - Springer
Modern computers are typically heterogeneous devices—besides the standard central
processing unit (CPU), they commonly include an accelerator such as a graphics processing …
processing unit (CPU), they commonly include an accelerator such as a graphics processing …
Leveraging the Hardware Resources to Accelerate cryo-EM Reconstruction of RELION on the New Sunway Supercomputer
The fast development of biomolecular structure determination has enabled the fine-grained
study of objects in the micro-world, such as proteins and RNAs. The world is benefited …
study of objects in the micro-world, such as proteins and RNAs. The world is benefited …