Data-aware task scheduling on multi-accelerator based platforms

C Augonnet, J Clet-Ortega, S Thibault… - 2010 IEEE 16th …, 2010 - ieeexplore.ieee.org
To fully tap into the potential of heterogeneous machines composed of multicore processors
and multiple accelerators, simple offloading approaches in which the main trunk of the …

An OpenCL framework for heterogeneous multicores with local memory

J Lee, J Kim, S Seo, S Kim, J Park, H Kim… - Proceedings of the 19th …, 2010 - dl.acm.org
In this paper, we present the design and implementation of an Open Computing Language
(OpenCL) framework that targets heterogeneous accelerator multicore architectures with …

[PDF][PDF] The case for heterogeneous HTAP

R Appuswamy, M Karpathiotakis… - … on Innovative Data …, 2017 - infoscience.epfl.ch
Modern database engines balance the demanding requirements of mixed, hybrid
transactional and analytical processing (HTAP) workloads by relying on i) global shared …

Automatic compilation of MATLAB programs for synergistic execution on heterogeneous processors

A Prasad, J Anantpur, R Govindarajan - Proceedings of the 32nd ACM …, 2011 - dl.acm.org
MATLAB is an array language, initially popular for rapid prototy**, but is now being
increasingly used to develop production code for numerical and scientific applications …

Hybrid caching techniques and garbage collection using hybrid caching techniques

CY Cher, MK Gschwind - US Patent 8,312,219, 2012 - Google Patents
Hybrid caching techniques and garbage collection using hybrid caching techniques are
provided. A determination of a measure of a characteristic of a data object is performed, the …

Design and implementation of software-managed caches for multicores with local memory

S Seo, J Lee, Z Sura - 2009 IEEE 15th International Symposium …, 2009 - ieeexplore.ieee.org
Heterogeneous multicores, such as Cell BE processors and GPGPUs, typically do not have
caches for their accelerator cores because coherence traffic, cache misses, and latencies …

Optimizing memory bandwidth exploitation for OpenVX applications on embedded many-core accelerators

G Tagliavini, G Haugou, A Marongiu… - Journal of Real-Time …, 2018 - Springer
In recent years, image processing has been a key application area for mobile and
embedded computing platforms. In this context, many-core accelerators are a viable solution …

Scheduling Tasks over Multicore machines enhanced with acelerators: a Runtime System's Perspective

C Augonnet - 2011 - theses.hal.science
Multicore machines equipped with accelerators are becoming increasingly popular in the
High Performance Computing ecosystem. Hybrid architectures provide significantly …

Optimizing memory bandwidth in OpenVX graph execution on embedded many-core accelerators

G Tagliavini, G Haugou, L Benini - Proceedings of the 2014 …, 2014 - ieeexplore.ieee.org
Computer vision and computational photography are hot applications areas for mobile and
embedded computing platforms. As a consequence, many-core accelerators are being …

Partitioning and Data Map** in Reconfigurable Cache and Scratchpad Memory--Based Architectures

P Chakraborty, PR Panda, S Sen - ACM Transactions on Design …, 2016 - dl.acm.org
Scratchpad memory (SPM) is considered a useful component in the memory hierarchy,
solely or along with caches, for meeting the power and energy constraints as performance …