The gem5 simulator: Version 20.0+

J Lowe-Power, AM Ahmad, A Akram, M Alian… - arxiv preprint arxiv …, 2020 - arxiv.org
The open-source and community-supported gem5 simulator is one of the most popular tools
for computer architecture research. This simulation infrastructure allows researchers to …

Demystifying complex workload-dram interactions: An experimental study

S Ghose, T Li, N Ha**azar, DS Cali… - Proceedings of the ACM on …, 2019 - dl.acm.org
It has become increasingly difficult to understand the complex interactions between modern
applications and main memory, composed of Dynamic Random Access Memory (DRAM) …

sPIN: High-performance streaming Processing in the Network

T Hoefler, S Di Girolamo, K Taranov, RE Grant… - Proceedings of the …, 2017 - dl.acm.org
Optimizing communication performance is imperative for large-scale computing because
communication overheads limit the strong scalability of parallel applications. Today's …

Hardware-validated CPU performance and energy modelling

M Walker, S Bischoff, S Diestelhorst… - … Analysis of Systems …, 2018 - ieeexplore.ieee.org
Full-system simulation frameworks such as gem5 are used extensively to evaluate research
ideas and for design-space exploration. Moreover, energy-efficiency has become the key …

Full-system simulation of big. little multicore architecture for performance and energy exploration

A Butko, F Bruguier, A Gamatié… - 2016 IEEE 10th …, 2016 - ieeexplore.ieee.org
Single-ISA heterogeneous multicore processors have gained increasing popularity with the
introduction of recent technologies such as ARM big. LITTLE. These processors offer …

Micro-architectural simulation of embedded core heterogeneity with gem5 and mcpat

FA Endo, D Couroussé, HP Charles - … of the 2015 Workshop on Rapid …, 2015 - dl.acm.org
Energy consumption is the major factor limiting performance in embedded systems. In
addition, in the next generations of ICs, heat or energy constraints will not allow to power all …

Network-accelerated non-contiguous memory transfers

S Di Girolamo, K Taranov, A Kurth… - Proceedings of the …, 2019 - dl.acm.org
Applications often communicate data that is non-contiguous in the send-or the receive-
buffer, eg, when exchanging a column of a matrix stored in row-major order. While non …

[HTML][HTML] An experimental study of drift caused by partial shading using a modified DC-(P&O) technique for a stand-alone PV system

AK Singhal, NS Beniwal, R Beniwal, K Lalik - Energies, 2022 - mdpi.com
There is tremendous potential in solar energy to meet future electricity demands. Partial
shading (PS) and drift are two major problems that must be addressed simultaneously to …

Exploiting memory allocations in clusterised many‐core architectures

R Garibotti, L Ost, A Butko, R Reis… - IET Computers & …, 2019 - Wiley Online Library
Power‐efficient architectures have become the most important feature required for future
embedded systems. Modern designs, like those released on mobile devices, reveal that …

Evaluation of gem5 for performance modeling of ARM Cortex-R based embedded SoCs

I Wang, P Chakraborty, ZY Xue, YF Lin - Microprocessors and …, 2022 - Elsevier
ARM CPUs are prevalent in embedded systems ranging from low-power IoT to reasonably
high-powered mobile phones and devices. Embedded SoCs integrate a number of …