Big data analysis with signal processing on graphs: Representation and processing of massive data sets with irregular structure

A Sandryhaila, JMF Moura - IEEE signal processing magazine, 2014 - ieeexplore.ieee.org
Analysis and processing of very large data sets, or big data, poses a significant challenge.
Massive data sets are collected and studied in numerous domains, from engineering …

Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU

VW Lee, C Kim, J Chhugani, M Deisher, D Kim… - Proceedings of the 37th …, 2010 - dl.acm.org
Recent advances in computing have led to an explosion in the amount of data being
generated. Processing the ever-growing data in a timely manner has made throughput …

Processing array data on SIMD multi-core processor architectures

DG Carlson, TM Drucker, TJ Mullins… - US Patent …, 2013 - Google Patents
One embodiment of the invention includes a method for generating a SIMD data structure
tailored for processing fast Fourier transforms (FFTS) on a SIMD multi-core processor …

SPIRAL: Extreme performance portability

F Franchetti, TM Low, DT Popovici… - Proceedings of the …, 2018 - ieeexplore.ieee.org
In this paper, we address the question of how to automatically map computational kernels to
highly efficient code for a wide range of computing platforms and establish the correctness of …

Audio signal processing using graphics processing units

L Savioja, V Välimäki, JO Smith - Journal of the Audio Engineering Society, 2011 - aes.org
Current graphics processing units (GPUs) are massively parallel computing environments
offering remarkable performance boosts in parallelizable tasks. Audio signal processing is a …

SonicFFT: A system architecture for ultrasonic-based FFT acceleration

DA Patel, VP Bui, KTC Chai, A Lal… - 2022 27th Asia and …, 2022 - ieeexplore.ieee.org
Fast Fourier Transform (FFT) is an essential algorithm for numerous scientific and
engineering applications. It is key to implement FFT in a high-performance and energy …

MulticoreBSP for C: a high-performance library for shared-memory parallel programming

AN Yzelman, RH Bisseling, D Roose… - International Journal of …, 2014 - Springer
The bulk synchronous parallel (BSP) model, as well as parallel programming interfaces
based on BSP, classically target distributed-memory parallel architectures. In earlier work …

High-performance sparse fast Fourier transforms

J Schumacher, M Püschel - 2014 IEEE Workshop on Signal …, 2014 - ieeexplore.ieee.org
The sparse fast Fourier transform (SFFT) is a recent novel algorithm to compute discrete
Fourier transforms on signals with a sparse frequency domain with an improved asymptotic …