Invasive computing: An overview

J Teich, J Henkel, A Herkersdorf… - … system-on-chip …, 2011 - Springer
A novel paradigm for designing and programming future parallel computing systems called
invasive computing is proposed. The main idea and novelty of invasive computing is to …

Parallel programming models for heterogeneous many-cores: a comprehensive survey

J Fang, C Huang, T Tang, Z Wang - CCF Transactions on High …, 2020 - Springer
Heterogeneous many-cores are now an integral part of modern computing systems ranging
from embedding systems to supercomputers. While heterogeneous many-core design offers …

The ARM scalable vector extension

N Stephens, S Biles, M Boettcher, J Eapen… - IEEE micro, 2017 - ieeexplore.ieee.org
This article describes the ARM Scalable Vector Extension (SVE). Several goals guided the
design of the architecture. First was the need to extend the vector processing capability …

The future of microprocessors

S Borkar, AA Chien - Communications of the ACM, 2011 - dl.acm.org
The future of microprocessors Page 1 MAy 2011 | vOl. 54 | nO. 5 | CommunICatIons of the aCm
67 MICroProCessors—sInGLe-ChIP CoMPUters—are the building blocks of the information …

Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU

VW Lee, C Kim, J Chhugani, M Deisher, D Kim… - Proceedings of the 37th …, 2010 - dl.acm.org
Recent advances in computing have led to an explosion in the amount of data being
generated. Processing the ever-growing data in a timely manner has made throughput …

[КНИГА][B] Real-time rendering

T Akenine-Moller, E Haines, N Hoffman - 2019 - taylorfrancis.com
Thoroughly revised, this third edition focuses on modern techniques used to generate
synthetic three-dimensional images in a fraction of a second. With the advent of …

The multikernel: a new OS architecture for scalable multicore systems

A Baumann, P Barham, PE Dagand, T Harris… - Proceedings of the …, 2009 - dl.acm.org
Commodity computer systems contain more and more processor cores and exhibit
increasingly diverse architectural tradeoffs, including memory hierarchies, interconnects …

PacketShader: a GPU-accelerated software router

S Han, K Jang, KS Park, S Moon - ACM SIGCOMM Computer …, 2010 - dl.acm.org
We present PacketShader, a high-performance software router framework for general packet
processing with Graphics Processing Unit (GPU) acceleration. PacketShader exploits the …

PyCUDA and PyOpenCL: A scripting-based approach to GPU run-time code generation

A Klöckner, N Pinto, Y Lee, B Catanzaro, P Ivanov… - Parallel Computing, 2012 - Elsevier
High-performance computing has recently seen a surge of interest in heterogeneous
systems, with an emphasis on modern Graphics Processing Units (GPUs). These devices …

OpenPiton: An open source manycore research framework

J Balkind, M McKeown, Y Fu, T Nguyen, Y Zhou… - ACM SIGPLAN …, 2016 - dl.acm.org
Industry is building larger, more complex, manycore processors on the back of strong
institutional knowledge, but academic projects face difficulties in replicating that scale. To …