A recursive algebraic coloring technique for hardware-efficient symmetric sparse matrix-vector multiplication
The symmetric sparse matrix-vector multiplication (SymmSpMV) is an important building
block for many numerical linear algebra kernel operations or graph traversal applications …
block for many numerical linear algebra kernel operations or graph traversal applications …
Memory-aware optimization for sequences of sparse matrix-vector multiplications
This paper presents a novel approach to optimize multiple invocations of a sparse matrix-
vector multiplication (SpMV) kernel performed on the same sparse matrix A and dense …
vector multiplication (SpMV) kernel performed on the same sparse matrix A and dense …
Efficiently Running SpMV on Multi-core DSPs for Banded Matrix
Sparse matrix-vector multiplication (SpMV) plays a pivotal role in large-scale scientific
computing. Despite the increasing use of low-power multicore digital signal processors …
computing. Despite the increasing use of low-power multicore digital signal processors …
Optimizing sparse matrix storage for the big data era
The efficient handling of sparse matrices is essential in many applications. In particular, they
are critical in Big Data applications that involve large graphs, as these are often represented …
are critical in Big Data applications that involve large graphs, as these are often represented …
A dynamic approach for workload partitioning on GPU architectures
F Busato, N Bombieri - IEEE Transactions on Parallel and …, 2016 - ieeexplore.ieee.org
Workload partitioning and the subsequent work item-to-thread map** are key aspects to
face when implementing any efficient GPU application. Different techniques have been …
face when implementing any efficient GPU application. Different techniques have been …
High-Performance and Power-Aware Graph Processing on GPUs
F Busato - 2018 - iris.univr.it
Graphs are a common representation in many problem domains, including engineering,
finance, medicine, and scientific applications. Different problems map to very large graphs …
finance, medicine, and scientific applications. Different problems map to very large graphs …
[PDF][PDF] Optymalizacja wydajności obliczeniowej metody elementów skończonych w architekturze CUDA
A Dziekonski - 2015 - pbc.gda.pl
Dużą dokładność w procesie projektowania złożonych układów mikrofalowych
wykorzystywanych w komunikacji bezprzewodowej (np. anteny, filtry, sprzęgacze) można …
wykorzystywanych w komunikacji bezprzewodowej (np. anteny, filtry, sprzęgacze) można …