Transactional memory coherence and consistency
L Hammond, V Wong, M Chen, BD Carlstrom… - ACM SIGARCH …, 2004 - dl.acm.org
In this paper, we propos a new shared memory model: Transactionalmemory Coherence
and Consistency (TCC). TCC providesa model in which atomic transactions are always the …
and Consistency (TCC). TCC providesa model in which atomic transactions are always the …
The gem5 simulator: Version 20.0+
The open-source and community-supported gem5 simulator is one of the most popular tools
for computer architecture research. This simulation infrastructure allows researchers to …
for computer architecture research. This simulation infrastructure allows researchers to …
Reactive NUCA: near-optimal block placement and replication in distributed caches
Increases in on-chip communication delay and the large working sets of server and scientific
workloads complicate the design of the on-chip last-level cache for multicore processors …
workloads complicate the design of the on-chip last-level cache for multicore processors …
Cooperative caching for chip multiprocessors
J Chang, GS Sohi - ACM SIGARCH Computer Architecture News, 2006 - dl.acm.org
This paper presents CMP Cooperative Caching, a unified framework to manage a CMP's
aggregate on-chip cache resources. Cooperative caching combines the strengths of private …
aggregate on-chip cache resources. Cooperative caching combines the strengths of private …
[BOOK][B] A primer on memory consistency and cache coherence
Many modern computer systems, including homogeneous and heterogeneous architectures,
support shared memory in hardware. In a shared memory system, each of the processor …
support shared memory in hardware. In a shared memory system, each of the processor …
System-on-chip: Reuse and integration
Over the past ten years, as integrated circuits became increasingly more complex and
expensive, the industry began to embrace new design and reuse methodologies that are …
expensive, the industry began to embrace new design and reuse methodologies that are …
Transient-Execution Attacks: A Computer Architect Perspective
Computer architects employ a series of performance optimizations at the micro-architecture
level. These optimizations are meant to be invisible to the programmer but they are implicitly …
level. These optimizations are meant to be invisible to the programmer but they are implicitly …
DeNovo: Rethinking the memory hierarchy for disciplined parallelism
For parallelism to become tractable for mass programmers, shared-memory languages and
environments must evolve to enforce disciplined practices that ban" wild shared-memory …
environments must evolve to enforce disciplined practices that ban" wild shared-memory …
Cache coherence for GPU architectures
While scalable coherence has been extensively studied in the context of general purpose
chip multiprocessors (CMPs), GPU architectures present a new set of challenges …
chip multiprocessors (CMPs), GPU architectures present a new set of challenges …
Virtual circuit tree multicasting: A case for on-chip hardware multicast support
Current state-of-the-art on-chip networks provide efficiency, high throughput, and low latency
for one-to-one (unicast) traffic. The presence of one-to-many (multicast) or one-to-all …
for one-to-one (unicast) traffic. The presence of one-to-many (multicast) or one-to-all …