A survey comparing specialized hardware and evolution in TPUs for neural networks

A Shahid, M Mushtaq - 2020 IEEE 23rd International Multitopic …, 2020 - ieeexplore.ieee.org
This survey paper is based on the evolution of TPUs from first generation TPUs to edge
TPUs and their architectures. This paper compares CPUs, GPUs, FPGAs and TPUs, their …

A highly efficient multi-core algorithm for clustering extremely large datasets

JM Kraus, HA Kestler - BMC bioinformatics, 2010 - Springer
Background In recent years, the demand for computational power in computational biology
has increased due to rapidly growing data sets from microarray and other high-throughput …

Параллельный алгоритм решения задачи анализа рыночной корзины на процессорах Cell

КС Пан, МЛ Цымблер - Вестник Южно-Уральского …, 2010 - cyberleninka.ru
В работе рассматривается задача глубинного анализа данных-задача нахождения
часто втречающихся наборов товаров. Предложен параллельный алгоритм …

[PDF][PDF] Hardware alternatives for the implementation of machine learning systems applied to image processing

AC Cob-Parro, I Bravo-Muoz, A Gardel-Vicente… - researchgate.net
Due to the large amount of generated data for the new information technologies, it is
necessary to use specialised hardware to process this massive amount of information. For …

Self-Organizing Maps on the Cell Broadband Engine Architecture

SM McConnell - Journal of Physics: Conference Series, 2010 - iopscience.iop.org
We present and evaluate novel parallel implementations of Self-Organizing Maps for the
Cell Broadband Engine Architecture. Motivated by the interactive nature of the data-mining …

An Efficient Co-processing Framework for Large-Scale Scientific Applications

R Duan, RSM Goh, L Rachmawati… - 2014 IEEE 6th …, 2014 - ieeexplore.ieee.org
As scientific applications like Computational Fluid Dynamics (CFD) simulations generate
more and more data, co-processing becomes the most cost effective way to process the vast …

Investigation of a new cascade-of-resonators/spl Sigma/-/spl Delta/converter configuration

Y Botteron, B Nowrouzian, ATG Fuller… - … Symposium on Circuits …, 1998 - ieeexplore.ieee.org
In the past, a number of bandpass/spl Sigma/-/spl Delta/a converter configurations have
appeared in the literature. Among these, there are two configurations which are of particular …

A parallel point matching algorithm for landmark based image registration using multicore platform

L Yang, L Gong, H Zhang, JL Nosher… - Euro-Par 2009 Parallel …, 2009 - Springer
Point matching is crucial for many computer vision applications. Establishing the
correspondence between a large number of data points is a computationally intensive …

On the Efficient Implementation of Reductions on the Cell Broadband Engine

A Strey - 2010 18th Euromicro Conference on Parallel …, 2010 - ieeexplore.ieee.org
For a high-performance parallel implementation of many scientific algorithms, efficient
realizations of combining communication patterns like reduce or all-reduce are important …

Interactive data mining on a CBEA cluster

S McConnell, D Patton, R Hurley, W Blight… - … Computing Systems and …, 2010 - Springer
We present implementations of two data-mining algorithms on a CELL processor, and on a
low-cost CBEA (CELL Broadband Engine Architecture) cluster using multiple PlayStation3 …