Tetris: Scalable and efficient neural network acceleration with 3d memory
The high accuracy of deep neural networks (NNs) has led to the development of NN
accelerators that improve performance by two orders of magnitude. However, scaling these …
accelerators that improve performance by two orders of magnitude. However, scaling these …
Accelergy: An architecture-level energy estimation methodology for accelerator designs
With Moore's law slowing down and Dennard scaling ended, energy-efficient domain-
specific accelerators, such as deep neural network (DNN) processors for machine learning …
specific accelerators, such as deep neural network (DNN) processors for machine learning …
The gem5 simulator
The gem5 simulation infrastructure is the merger of the best aspects of the M5 [4] and GEMS
[9] simulators. M5 provides a highly configurable simulation framework, multiple ISAs, and …
[9] simulators. M5 provides a highly configurable simulation framework, multiple ISAs, and …
McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures
This paper introduces McPAT, an integrated power, area, and timing modeling framework
that supports comprehensive design space exploration for multicore and manycore …
that supports comprehensive design space exploration for multicore and manycore …
Practical near-data processing for in-memory analytics frameworks
The end of Dennard scaling has made all systemsenergy-constrained. For data-intensive
applications with limitedtemporal locality, the major energy bottleneck is data …
applications with limitedtemporal locality, the major energy bottleneck is data …
DSENT-a tool connecting emerging photonics with electronics for opto-electronic networks-on-chip modeling
With the rise of many-core chips that require substantial bandwidth from the network on chip
(NoC), integrated photonic links have been investigated as a promising alternative to …
(NoC), integrated photonic links have been investigated as a promising alternative to …
HRL: Efficient and flexible reconfigurable logic for near-data processing
The energy constraints due to the end of Dennard scaling, the popularity of in-memory
analytics, and the advances in 3D integration technology have led to renewed interest in …
analytics, and the advances in 3D integration technology have led to renewed interest in …
The structural simulation toolkit
AF Rodrigues, KS Hemmert, BW Barrett… - ACM SIGMETRICS …, 2011 - dl.acm.org
As supercomputers grow, understanding their behavior and performance has become
increasingly challenging. New hurdles in scalability, programmability, power consumption …
increasingly challenging. New hurdles in scalability, programmability, power consumption …
The McPAT framework for multicore and manycore architectures: Simultaneously modeling power, area, and timing
This article introduces McPAT, an integrated power, area, and timing modeling framework
that supports comprehensive design space exploration for multicore and manycore …
that supports comprehensive design space exploration for multicore and manycore …
Orion 2.0: A power-area simulator for interconnection networks
As industry moves towards multicore chips, networks-on-chip (NoCs) are emerging as the
scalable fabric for interconnecting the cores. With power now the first-order design …
scalable fabric for interconnecting the cores. With power now the first-order design …