Transient-Execution Attacks: A Computer Architect Perspective
Computer architects employ a series of performance optimizations at the micro-architecture
level. These optimizations are meant to be invisible to the programmer but they are implicitly …
level. These optimizations are meant to be invisible to the programmer but they are implicitly …
Compiler-directed whole-system persistence
Nonvolatile memory (NVM) technologies have gained increasing attention thanks to their
density and durability benefits. However, leveraging NVM can cause a crash consistency …
density and durability benefits. However, leveraging NVM can cause a crash consistency …
Nosq: Store-load communication without a store queue
This paper presents NoSQ (short for No Store Queue), a microarchitecture that performs
store-load communication without a store queue and without executing stores in the outof …
store-load communication without a store queue and without executing stores in the outof …
Fire-and-forget: Load/store scheduling with no store queue at all
S Subramaniam, GH Loh - 2006 39th Annual IEEE/ACM …, 2006 - ieeexplore.ieee.org
Modern processors use CAM-based load and store queues (LQ/SQ) to support out-of-order
memory scheduling and store-to-load forwarding. However, the LQ and SQ scale poorly for …
memory scheduling and store-to-load forwarding. However, the LQ and SQ scale poorly for …
Zero-value caches: Cancelling loads that return zero
The speed gap between processor and memory continues to limit performance. To address
this problem, we explore the potential of eliminating zero loads-loads accessing memory …
this problem, we explore the potential of eliminating zero loads-loads accessing memory …
Late-binding: Enabling unordered load-store queues
Conventional load/store queues (LSQs) are an impediment to both power-efficient execution
in superscalar processors and scaling tolarge-window designs. In this paper, we propose …
in superscalar processors and scaling tolarge-window designs. In this paper, we propose …
A tale of two cities: case studies of GSS transition in two organizations
A Agres, GJ de Vreede… - 37th Annual Hawaii …, 2004 - ieeexplore.ieee.org
Research shows that, under certain circumstances, people using GSS can be substantially
more productive than people who do not. However GSS has been slow to transition into the …
more productive than people who do not. However GSS has been slow to transition into the …
Disambiguation-free out of order load store queue
MA Abdallah - US Patent 10,048,964, 2018 - Google Patents
In a processor, a disambiguation-free out of order load store queue method. The method
includes implementing a memory resource that can be accessed by a plurality of …
includes implementing a memory resource that can be accessed by a plurality of …
Static prediction of silent stores
A store operation is called “silent” if it writes in memory a value that is already there. The
ability to detect silent stores is important, because they might indicate performance bugs …
ability to detect silent stores is important, because they might indicate performance bugs …
TikTag: Breaking ARM's Memory Tagging Extension with Speculative Execution
ARM Memory Tagging Extension (MTE) is a new hardware feature introduced in ARMv8. 5-
A architecture, aiming to detect memory corruption vulnerabilities. The low overhead of MTE …
A architecture, aiming to detect memory corruption vulnerabilities. The low overhead of MTE …