- Academic Search

DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multiplication

Y Lu, W Liu - Proceedings of the International Conference for High …, 2023 - dl.acm.org

Sparse matrix-vector multiplication (SpMV) plays a key role in computational science and
engineering, graph processing, and machine learning applications. Much work on SpMV …

Salva Cita Citato da 10 Articoli correlati Tutte e 4 le versioni

Toward accelerated stencil computation by adapting tensor core unit on gpu

X Liu, Y Liu, H Yang, J Liao, M Li, Z Luan… - Proceedings of the 36th …, 2022 - dl.acm.org

The Tensor Core Unit (TCU) has been increasingly adopted on modern high performance
processors, specialized in boosting the performance of general matrix multiplication …

Salva Cita Citato da 21 Articoli correlati Tutte e 2 le versioni

[Free GPT-4]

[PDF] arxiv.org

Accelerating range minimum queries with ray tracing cores

E Meneses, CA Navarro, H Ferrada… - Future Generation …, 2024 - Elsevier

Over the past decade, GPU technology has undergone a notable transformation, evolving
from pure general-purpose computation to the integration of application-specific integrated …

Salva Cita Citato da 4 Articoli correlati Tutte e 3 le versioni

[Free GPT-4]

[PDF] arxiv.org

Modeling GPU Dynamic Parallelism for self similar density workloads

FA Quezada, CA Navarro, M Romero… - Future Generation …, 2023 - Elsevier

Dynamic Parallelism (DP) is a GPU programming abstraction that can make parallel
computation more efficient for problems that exhibit heterogeneous workloads. With DP …

Salva Cita Citato da 4 Articoli correlati Tutte e 5 le versioni

[Free GPT-4]

[PDF] acm.org

Bitmap-Based Sparse Matrix-Vector Multiplication with Tensor Cores

YA Chen, JX Yu - Proceedings of the 53rd International Conference on …, 2024 - dl.acm.org

Sparse matrix-vector multiplication (SpMV) plays a crucial role in various scientific and
engineering tasks. Thus, extensive research efforts are devoted to enhancing its …

Salva Cita Articoli correlati

[Free GPT-4]

[PDF] arxiv.org

A scalable and energy efficient GPU thread map for m-simplex domains

CA Navarro, FA Quezada, B Bustos, N Hitschfeld… - Future Generation …, 2023 - Elsevier

This work proposes a new GPU thread map for m-simplex domains that improves its
speedup along with the m-dimension and is energy efficient compared to other state of the …

Salva Cita Articoli correlati Tutte e 4 le versioni

[Free GPT-4]

[PDF] nsf.gov

TensorCV: Accelerating Inference-Adjacent Computation Using Tensor Processors

D Ha, WW Ro, HW Tseng - 2023 IEEE/ACM International …, 2023 - ieeexplore.ieee.org

The advancements in AI/ML accelerators have made the core AI/ML computation relatively
insignificant in application pipelines. For example, inferencing only accounts for 3% of the …

Salva Cita Articoli correlati Tutte e 2 le versioni

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Squeeze: Efficient compact fractals for tensor core gpus

DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multiplication

Toward accelerated stencil computation by adapting tensor core unit on gpu

Accelerating range minimum queries with ray tracing cores

Modeling GPU Dynamic Parallelism for self similar density workloads

Bitmap-Based Sparse Matrix-Vector Multiplication with Tensor Cores

A scalable and energy efficient GPU thread map for m-simplex domains

TensorCV: Accelerating Inference-Adjacent Computation Using Tensor Processors