Scalable algorithms for molecular dynamics simulations on commodity clusters
Although molecular dynamics (MD) simulations of biomolecular systems often run for days to
months, many events of great scientific interest and pharmaceutical relevance occur on long …
months, many events of great scientific interest and pharmaceutical relevance occur on long …
Addressing failures in exascale computing
We present here a report produced by a workshop on 'Addressing failures in exascale
computing'held in Park City, Utah, 4–11 August 2012. The charter of this workshop was to …
computing'held in Park City, Utah, 4–11 August 2012. The charter of this workshop was to …
Architecture exploration for ambient energy harvesting nonvolatile processors
Energy harvesting has been widely investigated as a promising method of providing power
for ultra-low-power applications. Such energy sources include solar energy, radio-frequency …
for ultra-low-power applications. Such energy sources include solar energy, radio-frequency …
Deterministic replay: A survey
Deterministic replay is a type of emerging technique dedicated to providing deterministic
executions of computer programs in the presence of nondeterministic factors. The …
executions of computer programs in the presence of nondeterministic factors. The …
ThyNVM: Enabling software-transparent crash consistency in persistent memory systems
Emerging byte-addressable nonvolatile memories (NVMs) promise persistent memory,
which allows processors to directly access persistent data in main memory. Yet, persistent …
which allows processors to directly access persistent data in main memory. Yet, persistent …
Detailed design and evaluation of redundant multithreading alternatives
Exponential growth in the number of on-chip transistors, coupled with reductions in voltage
levels, makes each generation of microprocessors increasingly vulnerable to transient faults …
levels, makes each generation of microprocessors increasingly vulnerable to transient faults …
[LIBRO][B] Architecture design for soft errors
S Mukherjee - 2011 - books.google.com
Architecture Design for Soft Errors provides a comprehensive description of the architectural
techniques to tackle the soft error problem. It covers the new methodologies for quantitative …
techniques to tackle the soft error problem. It covers the new methodologies for quantitative …
DMTCP: Transparent checkpointing for cluster computations and the desktop
DMTCP (distributed multithreaded checkpointing) is a transparent user-level checkpointing
package for distributed applications. Checkpointing and restart is demonstrated for a wide …
package for distributed applications. Checkpointing and restart is demonstrated for a wide …
A" flight data recorder" for enabling full-system multiprocessor deterministic replay
Debuggers have been proven indispensable in improving software reliability. Unfortunately,
on most real-life software, debuggers fail to deliver their most essential feature---a faithful …
on most real-life software, debuggers fail to deliver their most essential feature---a faithful …
Bugnet: Continuously recording program execution for deterministic replay debugging
Significant time is spent by companies trying to reproduce and fix the bugs that occur for
released code. To assist developers, we propose the BugNet architecture to continuously …
released code. To assist developers, we propose the BugNet architecture to continuously …