Shared memory consistency models: A tutorial

SV Adve, K Gharachorloo - computer, 1996 - ieeexplore.ieee.org
The memory consistency model of a system affects performance, programmability, and
portability. We aim to describe memory consistency models in a way that most computer …

Invisispec: Making speculative execution invisible in the cache hierarchy

M Yan, J Choi, D Skarlatos, A Morrison… - 2018 51st Annual …, 2018 - ieeexplore.ieee.org
Hardware speculation offers a major surface for micro-architectural covert and side channel
attacks. Unfortunately, defending against speculative execution attacks is challenging. The …

Speculative taint tracking (stt) a comprehensive protection for speculatively accessed data

J Yu, M Yan, A Khyzha, A Morrison, J Torrellas… - Proceedings of the …, 2019 - dl.acm.org
Speculative execution attacks present an enormous security threat, capable of reading
arbitrary program data under malicious speculation, and later exfiltrating that data over …

Memory persistency

S Pelley, PM Chen, TF Wenisch - ACM SIGARCH Computer Architecture …, 2014 - dl.acm.org
Emerging nonvolatile memory technologies (NVRAM) promise the performance of DRAM
with the persistence of disk. However, constraining NVRAM write order, necessary to ensure …

[BUCH][B] Parallel computer architecture: a hardware/software approach

D Culler, JP Singh, A Gupta - 1999 - books.google.com
The most exciting development in parallel computer architecture is the convergence of
traditionally disparate approaches on a common machine structure. This book explains the …

The Java memory model

J Manson, W Pugh, SV Adve - ACM SIGPLAN Notices, 2005 - dl.acm.org
This paper describes the new Java memory model, which has been revised as part of Java
5.0. The model specifies the legal behaviors for a multithreaded program; it defines the …

Foundations of the C++ concurrency memory model

HJ Boehm, SV Adve - ACM SIGPLAN Notices, 2008 - dl.acm.org
Currently multi-threaded C or C++ programs combine a single-threaded programming
language with a separate threads library. This is not entirely sound [7]. We describe an effort …

Reactive NUCA: near-optimal block placement and replication in distributed caches

N Hardavellas, M Ferdman, B Falsafi… - Proceedings of the 36th …, 2009 - dl.acm.org
Increases in on-chip communication delay and the large working sets of server and scientific
workloads complicate the design of the on-chip last-level cache for multicore processors …

Speculative lock elision: Enabling highly concurrent multithreaded execution

R Rajwar, JR Goodman - Proceedings. 34th ACM/IEEE …, 2001 - ieeexplore.ieee.org
Serialization of threads due to critical sections is a fundamental bottleneck to achieving high
performance in multithreaded programs. Dynamically, such serialization may be …

Delegated persist ordering

A Kolli, J Rosen, S Diestelhorst, A Saidi… - 2016 49th Annual …, 2016 - ieeexplore.ieee.org
Systems featuring a load-store interface to persistent memory (PM) are expected soon,
making in-memory persistent data structures feasible. Ensuring persistent data structure …