Chip and package-scale interconnects for general-purpose, domain-specific and quantum computing systems-overview, challenges and opportunities

A Das, M Palesi, J Kim… - IEEE Journal on Emerging …, 2024 - ieeexplore.ieee.org
The anticipated end of Moore's law, coupled with the breakdown of Dennard scaling,
compelled everyone to conceive forthcoming computing systems once transistors reach their …

A survey on coarse-grained reconfigurable architectures from a performance perspective

A Podobas, K Sano, S Matsuoka - IEEE Access, 2020 - ieeexplore.ieee.org
With the end of both Dennard's scaling and Moore's law, computer users and researchers
are aggressively exploring alternative forms of computing in order to continue the …

On-chip interconnection architecture of the tile processor

D Wentzlaff, P Griffin, H Hoffmann, L Bao… - IEEE micro, 2007 - ieeexplore.ieee.org
IMesh, the tile processor architecture's on-chip interconnection network, connects the
multicore processor's tiles with five 2D mesh networks, each specialized for a different use …

Conservation cores: reducing the energy of mature computations

G Venkatesh, J Sampson, N Goulding, S Garcia… - ACM Sigplan …, 2010 - dl.acm.org
Growing transistor counts, limited power budgets, and the breakdown of voltage scaling are
currently conspiring to create a utilization wall that limits the fraction of a chip that can run at …

ORION 2.0: A fast and accurate NoC power and area model for early-stage design space exploration

AB Kahng, B Li, LS Peh… - 2009 Design, Automation & …, 2009 - ieeexplore.ieee.org
As industry moves towards many-core chips, networks-on-chip (NoCs) are emerging as the
scalable fabric for interconnecting the cores. With power now the first-order design …

Graphite: A distributed parallel simulator for multicores

JE Miller, H Kasture, G Kurian… - HPCA-16 2010 The …, 2010 - ieeexplore.ieee.org
This paper introduces the Graphite open-source distributed parallel multicore simulator
infrastructure. Graphite is designed from the ground up for exploration of future multi-core …

Corona: System implications of emerging nanophotonic technology

D Vantrease, R Schreiber, M Monchiero… - ACM SIGARCH …, 2008 - dl.acm.org
We expect that many-core microprocessors will push performance per chip from the 10
gigaflop to the 10 teraflop range in the coming decade. To support this increased …

A performance study of general-purpose applications on graphics processors using CUDA

S Che, M Boyer, J Meng, D Tarjan, JW Sheaffer… - Journal of parallel and …, 2008 - Elsevier
Graphics processors (GPUs) provide a vast number of simple, data-parallel, deeply
multithreaded cores and high memory bandwidths. GPU architectures are becoming …

A case for bufferless routing in on-chip networks

T Moscibroda, O Mutlu - Proceedings of the 36th annual international …, 2009 - dl.acm.org
Buffers in on-chip networks consume significant energy, occupy chip area, and increase
design complexity. In this paper, we make a case for a new approach to designing on-chip …

Exploiting coarse-grained task, data, and pipeline parallelism in stream programs

MI Gordon, W Thies, S Amarasinghe - ACM SIGPLAN Notices, 2006 - dl.acm.org
As multicore architectures enter the mainstream, there is a pressing demand for high-level
programming models that can effectively map to them. Stream programming offers an …