An evaluation of edge tpu accelerators for convolutional neural networks

K Seshadri, B Akin, J Laudon… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Edge TPUs are a domain of accelerators for low-power, edge devices and are widely used
in various Google products such as Coral and Pixel devices. In this paper, we first discuss …

GRANITE: A graph neural network model for basic block throughput estimation

O Sýkora, PM Phothilimthana, C Mendis… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Analytical hardware performance models yield swift estimation of desired hardware
performance metrics. However, develo** these analytical models for modern processors …

[PDF][PDF] El Criterio de Informació n de Akaike en la Obtenció n de Modelos Estadısticos de Rendimiento

DR Martınez, J Albın, J Cabaleiro, T Pena… - … : XX Jornadas de …, 2009 - researchgate.net
Este artıculo presenta un método de obtención de modelos estadısticos de rendimiento de
aplicaciones paralelas basado en la selección de modelos mediante el criterio de …

Analytical estimation of the scalability of iterative numerical algorithms on distributed memory multiprocessors

LB Sokolinsky - Lobachevskii Journal of Mathematics, 2018 - Springer
This article presents a new high-level parallel computational model named BSF"—Bulk
Synchronous Farm. The BSF model extends the BSP model to deal with the …

Accurate analytical performance model of communications in MPI applications

DR Martínez, JC Cabaleiro, TF Pena… - … on Parallel & …, 2009 - ieeexplore.ieee.org
This paper presents a new LogP-based model, called LoOgGP, which allows an accurate
characterization of MPI applications based on microbenchmark measurements. This new …

Parallel execution time prediction of the multitask parallel programs

R Wu, J Sun, J Chen - Performance Evaluation, 2008 - Elsevier
A critical problem of predicting the execution time of parallel programs is computing the
maximum execution time of tasks involved in the parallel computation. For a parallel …

Analytical performance models of parallel programs in clusters

DR Martınez, V Blanco, M Boullón… - Parallel Computing …, 2007 - books.google.com
This paper presents a framework based on an user driven methodology to obtain analytical
models on parallel systems and, in particular, clusters. This framework consists of two …

Towards automated construction of compiler optimizations

TCY Mendis - 2020 - dspace.mit.edu
First, we present goSLP, a framework that uses integer linear programming to find a globally
pairwise-optimal statement packing strategy to achieve superior vectorization performance …

Performance modeling of mpi applications using model selection techniques

DR Martinez, JC Cabaleiro, TF Pena… - 2010 18th Euromicro …, 2010 - ieeexplore.ieee.org
A new method for obtaining models of the performance of parallel applications based on
statistical analysis is presented in this paper. This method is based on the Akaike's …

Software tools for performance modeling of parallel programs

DR Martínez, V Blanco, M Boullón… - 2007 IEEE …, 2007 - ieeexplore.ieee.org
This paper presents a framework based on a user driven methodology to obtain analytical
models of MPI applications on parallel systems in a systematic and easy to use way. This …