A novel data transformation and execution strategy for accelerating sparse matrix multiplication on GPUs

P Jiang, C Hong, G Agrawal - Proceedings of the 25th ACM SIGPLAN …, 2020 - dl.acm.org
SpMM (multiplication of a sparse matrix and a dense matrix) and SDDMM (sampled dense-
dense matrix multiplication) are at the core of many scientific, machine learning, and data …

FastZ: accelerating gapped whole genome alignment on GPUs

SC Gundabolu, TN Vijaykumar… - Proceedings of the …, 2021 - dl.acm.org
Recognizing the importance of whole genome alignment (WGA), the National Institutes for
Health maintains LASTZ, a sequential WGA application. As genomic data grows, there is a …

Generalized full sparse tiling of loop chains

CD Krieger - 2013 - search.proquest.com
Computer and computational scientists are tackling increasingly large and complex
problems and are seeking ways of improving the performance of their codes. The key issue …

Final Report for Award# DE-SC3956 Separating Algorithm and Implementation via programming Model Injection (SAIMI)

M Strout - 2015 - osti.gov
Programming parallel machines is fraught with difficulties: the obfuscation of algorithms due
to implementation details such as communication and synchronization, the need for …