DeNovo: Rethinking the memory hierarchy for disciplined parallelism

B Choi, R Komuravelli, H Sung… - 2011 International …, 2011 - ieeexplore.ieee.org
For parallelism to become tractable for mass programmers, shared-memory languages and
environments must evolve to enforce disciplined practices that ban" wild shared-memory …

SynFull: Synthetic traffic models capturing cache coherent behaviour

M Badr, NE Jerger - ACM SIGARCH Computer Architecture News, 2014 - dl.acm.org
Modern and future many-core systems represent complex architectures. The communication
fabrics of these large systems heavily influence their performance and power consumption …

Cuckoo directory: A scalable directory for many-core systems

M Ferdman, P Lotfi-Kamran, K Balet… - 2011 IEEE 17th …, 2011 - ieeexplore.ieee.org
Growing core counts have highlighted the need for scalable on-chip coherence
mechanisms. The increase in the number of on-chip cores exposes the energy and area …

SCD: A scalable coherence directory with flexible sharer set encoding

D Sanchez, C Kozyrakis - IEEE International Symposium on …, 2012 - ieeexplore.ieee.org
Large-scale CMPs with hundreds of cores require a directory-based protocol to maintain
cache coherence. However, previously proposed coherence directories are hard to scale …

The bunker cache for spatio-value approximation

J San Miguel, J Albericio, NE Jerger… - 2016 49th Annual IEEE …, 2016 - ieeexplore.ieee.org
The cost of moving and storing data is still a fundamental concern for computer architects.
Inefficient handling of data can be attributed to conventional architectures being oblivious to …

SPACE: Sharing pattern-based directory coherence for multicore scalability

H Zhao, A Shriraman, S Dwarkadas - Proceedings of the 19th …, 2010 - dl.acm.org
An important challenge in multicore processors is the maintenance of cache coherence in a
scalable manner. Directory-based protocols save bandwidth and achieve scalability by …

Selective GPU caches to eliminate CPU-GPU HW cache coherence

N Agarwal, D Nellans, E Ebrahimi… - … Symposium on High …, 2016 - ieeexplore.ieee.org
Cache coherence is ubiquitous in shared memory multiprocessors because it provides a
simple, high performance memory abstraction to programmers. Recent work suggests …

Spatiotemporal coherence tracking

M Alisafaee - 2012 45th Annual IEEE/ACM International …, 2012 - ieeexplore.ieee.org
Chip-multiprocessors require a coherence directory to track data sharing and order
accesses to the shared data. Scaling coherence directories to support a large number of …

CCNoC: Specializing on-chip interconnects for energy efficiency in cache-coherent servers

S Volos, C Seiculescu, B Grot, NK Pour… - 2012 IEEE/ACM …, 2012 - ieeexplore.ieee.org
Many core chips are emerging as the architecture of choice to provide power efficiency and
improve performance, while riding Moore's Law. In these architectures, on-chip inter …

Coherence domain restriction on large scale systems

Y Fu, TM Nguyen, D Wentzlaff - … of the 48th International Symposium on …, 2015 - dl.acm.org
Designing massive scale cache coherence systems has been an elusive goal. Whether it be
on large-scale GPUs, future thousand-core chips, or across million-core warehouse scale …