Taskstream: Accelerating task-parallel workloads by recovering program structure
Reconfigurable accelerators, like CGRAs and dataflow architectures, have come to
prominence for addressing data-processing problems. However, they are largely limited to …
prominence for addressing data-processing problems. However, they are largely limited to …
FLAASH: Flexible Accelerator Architecture for Sparse High-Order Tensor Contraction
G Kulp, A Ensinger, L Chen - arxiv preprint arxiv:2404.16317, 2024 - arxiv.org
Tensors play a vital role in machine learning (ML) and often exhibit properties best explored
while maintaining high-order. Efficiently performing ML computations requires taking …
while maintaining high-order. Efficiently performing ML computations requires taking …
[BOOK][B] Generalizing Programmable Accelerators for Irregularity
V Dadu - 2022 - search.proquest.com
Specialized accelerators are increasingly attractive solutions to continue expected
generational performance scaling with slowing technology scaling. Existing programmable …
generational performance scaling with slowing technology scaling. Existing programmable …