Clearing the clouds: a study of emerging scale-out workloads on modern hardware

M Ferdman, A Adileh, O Kocberber, S Volos… - Acm sigplan …, 2012 - dl.acm.org
Emerging scale-out workloads require extensive amounts of computational resources.
However, data centers using modern server hardware face physical constraints in space …

[書籍][B] Memory systems: cache, DRAM, disk

B Jacob, D Wang, S Ng - 2010 - books.google.com
Is your memory hierarchy stop** your microprocessor from performing at the high level it
should be? Memory Systems: Cache, DRAM, Disk shows you how to resolve this problem …

Memory coherence in shared virtual memory systems

K Li, P Hudak - ACM Transactions on Computer Systems (TOCS), 1989 - dl.acm.org
The memory coherence problem in designing and implementing a shared virtual memory on
loosely coupled multiprocessors is studied in depth. Two classes of algorithms, centralized …

[PDF][PDF] DBMSs on a modern processor: Where does time go?

A Ailamaki, DJ DeWitt, MD Hill… - VLDB'99, Proceedings of …, 1999 - infoscience.epfl.ch
Recent high-performance processors employ sophisticated techniques to overlap and
simultaneously execute multiple computation and memory operations. Intuitively, these …

Reactive NUCA: near-optimal block placement and replication in distributed caches

N Hardavellas, M Ferdman, B Falsafi… - Proceedings of the 36th …, 2009 - dl.acm.org
Increases in on-chip communication delay and the large working sets of server and scientific
workloads complicate the design of the on-chip last-level cache for multicore processors …

Review and evaluation of commonly-implemented background subtraction algorithms

Y Benezeth, PM Jodoin, B Emile… - 2008 19th …, 2008 - ieeexplore.ieee.org
Locating moving objects in a video sequence is the first step of many computer vision
applications. Among the various motion-detection techniques, background subtraction …

Piranha: A scalable architecture based on single-chip multiprocessing

LA Barroso, K Gharachorloo, R McNamara… - ACM SIGARCH …, 2000 - dl.acm.org
The microprocessor industry is currently struggling with higher development costs and
longer design times that arise from exceedingly complex processors that are pushing the …

Spatial memory streaming

S Somogyi, TF Wenisch, A Ailamaki, B Falsafi… - ACM SIGARCH …, 2006 - dl.acm.org
Prior research indicates that there is much spatial variation in applications' memory access
patterns. Modern memory systems, however, use small fixed-size cache blocks and as such …

Evaluation of hardware data prefetchers on server processors

M Bakhshalipour, S Tabaeiaghdaei… - ACM Computing …, 2019 - dl.acm.org
Data prefetching, ie, the act of predicting an application's future memory accesses and
fetching those that are not in the on-chip caches, is a well-known and widely used approach …

The price of performance: An economic case for chip multiprocessing

LA Barroso - Queue, 2005 - dl.acm.org
In the late 1990s, our research group at DEC was one of a growing number of teams
advocating the CMP (chip multiprocessor) as an alternative to highly complex single …