Enabling Highly Efficient k-Means Computations on the SW26010 Many-Core Processor of Sunway TaihuLight

M Li, C Yang, Q Sun, WJ Ma, WL Cao, YL Ao - Journal of Computer …, 2019 - Springer
With the advent of the big data era, the amounts of sampling data and the dimensions of
data features are rapidly growing. It is highly desired to enable fast and efficient clustering of …

Parallel Framework for Dimensionality Reduction of Large‐Scale Datasets

SK Samudrala, J Zola, S Aluru… - Scientific …, 2015 - Wiley Online Library
Dimensionality reduction refers to a set of mathematical techniques used to reduce
complexity of the original high‐dimensional data, while preserving its selected properties …

Computing large-scale distance matrices on GPU

AS Arefin, C Riveros, R Berretta… - 2012 7th International …, 2012 - ieeexplore.ieee.org
A distance matrix is simply an n× n two-dimensional array that contains pairwise distances of
a set of n points in a metric space. It has a wide range of usage in several fields of scientific …

All-pairs computations on many-core graphics processors

A Sarje, S Aluru - Parallel Computing, 2013 - Elsevier
Develo** high-performance applications on emerging multi-and many-core architectures
requires efficient map** techniques and architecture-specific tuning methodologies to …

Taming DNA clustering in massive datasets with SLYMFAST

M Belcaid, C Arisdakessian, Y Kravchenko - ACM SIGAPP Applied …, 2022 - dl.acm.org
Data from sequencing instruments are produced at such rates that their analysis is
becoming increasingly computationally challenging. Although DNA sequence clustering of …

A high performance algorithm for clustering of large-scale protein mass spectrometry data using multi-core architectures

F Saeed, JD Hoffert, MA Knepper - Proceedings of the 2013 IEEE/ACM …, 2013 - dl.acm.org
High-throughput mass spectrometers can produce thousands of peptide spectra from a
single complex protein sample in a short amount of time. These data sets contain a …

Exploiting thread-level and instruction-level parallelism to cluster mass spectrometry data using multicore architectures

F Saeed, JD Hoffert, T Pisitkun, MA Knepper - Network Modeling Analysis …, 2014 - Springer
Modern mass spectrometers can produce large numbers of peptide spectra from complex
biological samples in a short time. A substantial amount of redundancy is observed in these …

Efficient DNA sequence partitioning using probabilistic subsets and hypergraphs

M Belcaid, C Arisdakessian, Y Kravchenko - Proceedings of the 36th …, 2021 - dl.acm.org
Sequence clustering is an important computational step in numerous bioinformatics
applications such as high-throughput immune system characterization, marker-based …

Parallelizing complex streaming applications on distributed scratchpad memory multicore architecture

SK Chen, CY Hung, CC Chen, CW Liu - International Journal of Parallel …, 2014 - Springer
Multicore processors can provide sufficient computing power and flexibility for complex
streaming applications, such as high-definition video processing. For less hardware …

Parallel applications employing pairwise computations on emerging architectures

A Sarje, S Aluru - … IEEE International Symposium on Parallel & …, 2010 - ieeexplore.ieee.org
Today's emerging architectures have higher levels of parallelism incorporated within a
processor. They require efficient strategies to extract the performance they have to offer. In …