Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
The landscape of exascale research: A data-driven literature analysis
The next generation of supercomputers will break the exascale barrier. Soon we will have
systems capable of at least one quintillion (billion billion) floating-point operations per …
systems capable of at least one quintillion (billion billion) floating-point operations per …
Benchmarking machine learning methods for performance modeling of scientific applications
Performance modeling is an important and active area of research in high-performance
computing (HPC). It helps in better job scheduling and also improves overall performance of …
computing (HPC). It helps in better job scheduling and also improves overall performance of …
Kerncraft: A tool for analytic performance modeling of loop kernels
Achieving optimal program performance requires deep insight into the interaction between
hardware and software. For software developers without an in-depth background in …
hardware and software. For software developers without an in-depth background in …
Tida: High-level programming abstractions for data locality management
The high energy costs for data movement compared to computation gives paramount
importance to data locality management in programs. Managing data locality manually is not …
importance to data locality management in programs. Managing data locality manually is not …
Prediction modeling for application-specific communication architecture design of optical NoC
Multi-core systems-on-chip are becoming state-of-the-art. Therefore, there is a need for a
fast and energy-efficient interconnect to take full advantage of the computational capabilities …
fast and energy-efficient interconnect to take full advantage of the computational capabilities …
Automatic loop kernel analysis and performance modeling with kerncraft
Analytic performance models are essential for understanding the performance
characteristics of loop kernels, which consume a major part of CPU cycles in computational …
characteristics of loop kernels, which consume a major part of CPU cycles in computational …
Accelerating finite-rate chemical kinetics with coprocessors: Comparing vectorization methods on GPUs, MICs, and CPUs
CP Stone, AT Alferman, KE Niemeyer - Computer Physics Communications, 2018 - Elsevier
Accurate and efficient methods for solving stiff ordinary differential equations (ODEs) are a
critical component of turbulent combustion simulations with finite-rate chemistry. The ODEs …
critical component of turbulent combustion simulations with finite-rate chemistry. The ODEs …
Ppt-multicore: Performance prediction of openmp applications using reuse profiles and analytical modeling
We present PPT-Multicore, an analytical model embedded in the Performance Prediction
Toolkit (PPT) to predict parallel applications' performance running on a multicore processor …
Toolkit (PPT) to predict parallel applications' performance running on a multicore processor …
ComDetective: a lightweight communication detection tool for threads
Inter-thread communication is a vital performance indicator in shared-memory systems. Prior
works on identifying inter-thread communication employed hardware simulators or binary …
works on identifying inter-thread communication employed hardware simulators or binary …
Generating performance models for irregular applications
Many applications have irregular behavior-eg, input-dependent solvers, irregular memory
accesses, or unbiased branches-that cannot be captured using today's automated …
accesses, or unbiased branches-that cannot be captured using today's automated …