Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey on checkpointing strategies: Should we always checkpoint à la Young/Daly?
Abstract The Young/Daly formula provides an approximation of the optimal checkpointing
period for a parallel application executing on a supercomputing platform. It was originally …
period for a parallel application executing on a supercomputing platform. It was originally …
Addressing failures in exascale computing
We present here a report produced by a workshop on 'Addressing failures in exascale
computing'held in Park City, Utah, 4–11 August 2012. The charter of this workshop was to …
computing'held in Park City, Utah, 4–11 August 2012. The charter of this workshop was to …
[BUKU][B] Fault tolerance techniques for high-performance computing
This chapter provides an introduction to resilience methods. The emphasis is on
checkpointing, the de-facto standard technique for resilience in High Performance …
checkpointing, the de-facto standard technique for resilience in High Performance …
Lessons learned from the analysis of system failures at petascale: The case of blue waters
C Di Martino, Z Kalbarczyk, RK Iyer… - 2014 44th Annual …, 2014 - ieeexplore.ieee.org
This paper provides an analysis of failures and their impact for Blue Waters, the Cray hybrid
(CPU/GPU) supercomputer at the University of Illinois at Urbana-Champaign. The analysis …
(CPU/GPU) supercomputer at the University of Illinois at Urbana-Champaign. The analysis …
Dare: High-performance state machine replication on rdma networks
The increasing amount of data that needs to be collected and analyzed requires large-scale
datacenter architectures that are naturally more susceptible to faults of single components …
datacenter architectures that are naturally more susceptible to faults of single components …
Diagnosing performance variations in HPC applications using machine learning
With the growing complexity and scale of high performance computing (HPC) systems,
application performance variation has become a significant challenge in efficient and …
application performance variation has become a significant challenge in efficient and …
Failure prediction for HPC systems and applications: Current situation and open issues
As large-scale systems evolve towards post-petascale computing, it is crucial to focus on
providing fault-tolerance strategies that aim to minimize fault's effects on applications. By far …
providing fault-tolerance strategies that aim to minimize fault's effects on applications. By far …
A shoulder surfing resistant graphical authentication system
Authentication based on passwords is used largely in applications for computer security and
privacy. However, human actions such as choosing bad passwords and inputting passwords …
privacy. However, human actions such as choosing bad passwords and inputting passwords …
Fault prediction under the microscope: A closer look into HPC systems
A large percentage of computing capacity in today's large high-performance computing
systems is wasted because of failures. Consequently current research is focusing on …
systems is wasted because of failures. Consequently current research is focusing on …
Reading between the lines of failure logs: Understanding how HPC systems fail
N El-Sayed, B Schroeder - 2013 43rd annual IEEE/IFIP …, 2013 - ieeexplore.ieee.org
As the component count in supercomputing installations continues to increase, system
reliability is becoming one of the major issues in designing HPC systems. These issues will …
reliability is becoming one of the major issues in designing HPC systems. These issues will …