Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
The landscape of exascale research: A data-driven literature analysis
The next generation of supercomputers will break the exascale barrier. Soon we will have
systems capable of at least one quintillion (billion billion) floating-point operations per …
systems capable of at least one quintillion (billion billion) floating-point operations per …
A checkpoint of research on parallel i/o for high-performance computing
We present a comprehensive survey on parallel I/O in the high-performance computing
(HPC) context. This is an important field for HPC because of the historic gap between …
(HPC) context. This is an important field for HPC because of the historic gap between …
Performance optimality or reproducibility: that is the question
The era of extremely heterogeneous supercomputing brings with itself the devil of increased
performance variation and reduced reproducibility. There is a lack of understanding in the …
performance variation and reduced reproducibility. There is a lack of understanding in the …
A systematic survey on fault-tolerant solutions for distributed data analytics: Taxonomy, comparison, and future directions
S Isukapalli, SN Srirama - Computer Science Review, 2024 - Elsevier
Fault tolerance is becoming increasingly important for upcoming exascale systems,
supporting distributed data processing, due to the expected decrease in the Mean Time …
supporting distributed data processing, due to the expected decrease in the Mean Time …
EReinit: Scalable and efficient fault‐tolerance for bulk‐synchronous MPI applications
Scientists from many different fields have been develo** Bulk‐Synchronous MPI
applications to simulate and study a wide variety of scientific phenomena. Since failure rates …
applications to simulate and study a wide variety of scientific phenomena. Since failure rates …
Exploring energy saving opportunities in fault tolerant HPC systems
Nowadays, improving the energy efficiency of high-performance computing (HPC) systems
is one of the main drivers in scientific and technological research. As large-scale HPC …
is one of the main drivers in scientific and technological research. As large-scale HPC …
Prediction of energy consumption by checkpoint/restart in HPC
The fault tolerance method most used today in high-performance computing (HPC) is
coordinated checkpointing. This, like any other fault tolerance method, adds additional …
coordinated checkpointing. This, like any other fault tolerance method, adds additional …
Optimizing checkpoint intervals for reduced energy use in exascale systems
In today's high performance computing (HPC) systems, the probability of applications
experiencing failures has increased significantly with the increase in the number of system …
experiencing failures has increased significantly with the increase in the number of system …
Fault-tolerant regularity-based real-time virtual resources
Many safety-critical applications employ embedded real-time systems where both timing and
fault tolerance requirements must be continually satisfied. The Regularity-based Resource …
fault tolerance requirements must be continually satisfied. The Regularity-based Resource …
Exploiting Efficiency Opportunities Based on Workloads with Electron on Heterogeneous Clusters
Resource Management tools for large-scale clusters and data centers typically schedule
resources based on task requirements specified in terms of processor, memory, and disk …
resources based on task requirements specified in terms of processor, memory, and disk …