A recursive algebraic coloring technique for hardware-efficient symmetric sparse matrix-vector multiplication

C Alappat, A Basermann, AR Bishop… - ACM Transactions on …, 2020 - dl.acm.org
The symmetric sparse matrix-vector multiplication (SymmSpMV) is an important building
block for many numerical linear algebra kernel operations or graph traversal applications …

Memory-aware optimization for sequences of sparse matrix-vector multiplications

Y Zhang, S Li, F Yuan, D Dong, X Yang… - 2023 IEEE …, 2023 - ieeexplore.ieee.org
This paper presents a novel approach to optimize multiple invocations of a sparse matrix-
vector multiplication (SpMV) kernel performed on the same sparse matrix A and dense …

Efficiently Running SpMV on Multi-core DSPs for Banded Matrix

D Bi, S Li, Y Zhang, X Yang, D Dong - International Conference on …, 2023 - Springer
Sparse matrix-vector multiplication (SpMV) plays a pivotal role in large-scale scientific
computing. Despite the increasing use of low-power multicore digital signal processors …

Optimizing sparse matrix storage for the big data era

R Marichal, E Dufrechou, P Ezzatti - … , Big Data & Emerging Topics: 9th …, 2021 - Springer
The efficient handling of sparse matrices is essential in many applications. In particular, they
are critical in Big Data applications that involve large graphs, as these are often represented …

A dynamic approach for workload partitioning on GPU architectures

F Busato, N Bombieri - IEEE Transactions on Parallel and …, 2016 - ieeexplore.ieee.org
Workload partitioning and the subsequent work item-to-thread map** are key aspects to
face when implementing any efficient GPU application. Different techniques have been …

High-Performance and Power-Aware Graph Processing on GPUs

F Busato - 2018 - iris.univr.it
Graphs are a common representation in many problem domains, including engineering,
finance, medicine, and scientific applications. Different problems map to very large graphs …

[PDF][PDF] Optymalizacja wydajności obliczeniowej metody elementów skończonych w architekturze CUDA

A Dziekonski - 2015 - pbc.gda.pl
Dużą dokładność w procesie projektowania złożonych układów mikrofalowych
wykorzystywanych w komunikacji bezprzewodowej (np. anteny, filtry, sprzęgacze) można …