Processing-in-memory: A workload-driven perspective
Many modern and emerging applications must process increasingly large volumes of data.
Unfortunately, prevalent computing paradigms are not designed to efficiently handle such …
Unfortunately, prevalent computing paradigms are not designed to efficiently handle such …
An efficient hardware supported and parallelization architecture for intelligent systems to overcome speculative overheads
In the last few decades, technology advancements have paved the way for the creation of
intelligent and autonomous systems that utilize complex calculations which are both time …
intelligent and autonomous systems that utilize complex calculations which are both time …
[CITATION][C] Using OpenMP: portable shared memory parallel programming
B Chapman - 2007 - books.google.com
A comprehensive overview of OpenMP, the standard application programming interface for
shared memory parallel computing—a reference for students and professionals." I hope that …
shared memory parallel computing—a reference for students and professionals." I hope that …
Learning from mistakes: a comprehensive study on real world concurrency bug characteristics
The reality of multi-core hardware has made concurrent programs pervasive. Unfortunately,
writing correct concurrent programs is difficult. Addressing this challenge requires advances …
writing correct concurrent programs is difficult. Addressing this challenge requires advances …
Transactional locking II
D Dice, O Shalev, N Shavit - International Symposium on Distributed …, 2006 - Springer
The transactional memory programming paradigm is gaining momentum as the approach of
choice for replacing locks in concurrent programming. This paper introduces the …
choice for replacing locks in concurrent programming. This paper introduces the …
Transactional memory: An overview
Writing applications that benefit from the massive computational power of future multicore
chip multiprocessors will not be an easy task for mainstream programmers accustomed to …
chip multiprocessors will not be an easy task for mainstream programmers accustomed to …
LogTM: Log-based transactional memory
KE Moore, J Bobba, MJ Moravan… - … Symposium on High …, 2006 - ieeexplore.ieee.org
Transactional memory (TM) simplifies parallel programming by guaranteeing that
transactions appear to execute atomically and in isolation. Implementing these properties …
transactions appear to execute atomically and in isolation. Implementing these properties …
Optimistic parallelism requires abstractions
Irregular applications, which manipulate large, pointer-based data structures like graphs, are
difficult to parallelize manually. Automatic tools and techniques such as restructuring …
difficult to parallelize manually. Automatic tools and techniques such as restructuring …
CoNDA: Efficient cache coherence support for near-data accelerators
Specialized on-chip accelerators are widely used to improve the energy efficiency of
computing systems. Recent advances in memory technology have enabled near-data …
computing systems. Recent advances in memory technology have enabled near-data …
AVIO: detecting atomicity violations via access interleaving invariants
Concurrency bugs are among the most difficult to test and diagnose of all software bugs. The
multicore technology trend worsens this problem. Most previous concurrency bug detection …
multicore technology trend worsens this problem. Most previous concurrency bug detection …