GPU virtualization and scheduling methods: A comprehensive survey

CH Hong, I Spence, DS Nikolopoulos - ACM Computing Surveys (CSUR …, 2017 - dl.acm.org
The integration of graphics processing units (GPUs) on high-end compute nodes has
established a new accelerator-based heterogeneous computing model, which now …

DRMaestro: orchestrating disaggregated resources on virtualized data-centers

M Amaral, J Polo, D Carrera, N Gonzalez… - Journal of cloud …, 2021 - Springer
Modern applications demand resources at an unprecedented level. In this sense, data-
centers are required to scale efficiently to cope with such demand. Resource disaggregation …

MultiCL: Enabling automatic scheduling for task-parallel workloads in OpenCL

AM Aji, AJ Peña, P Balaji, W Feng - Parallel Computing, 2016 - Elsevier
The OpenCL specification tightly binds a command queue to a specific device. For best
performance, the user has to find the ideal queue-device map** at command queue …

Automatic command queue scheduling for task-parallel workloads in opencl

AM Aji, AJ Pena, P Balaji… - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
OpenCL is a portable interface that can be used to program cluster nodes with
heterogeneous compute devices. The OpenCL specification tightly binds its workflow …

VOCL-FT: introducing techniques for efficient soft error coprocessor recovery

AJ Peña, W Bland, P Balaji - … of the International Conference for High …, 2015 - dl.acm.org
Popular accelerator programming models rely on offloading computation operations and
their corresponding data transfers to the coprocessors, leveraging synchronization points …

gVMP: A multi-objective joint VM and vGPU placement heuristic for API remoting-based GPU virtualization and disaggregation in cloud data centers

A Siavashi, M Momtazpour - Journal of Parallel and Distributed Computing, 2023 - Elsevier
The diverse needs of customers drive cloud providers to incorporate more GPU-enabled
services. It is known that users barely utilize GPUs. Hence, GPU virtualization techniques …

Autonomic performance and power control on virtualized servers: Survey, practices, and trends

X Zhou, CJ Jiang - Journal of Computer Science and Technology, 2014 - Springer
Modern datacenter servers hosting popular Internet services face significant and multi-facet
challenges in performance and power control. The user-perceived performance is the result …

A flexible scheduling framework for heterogeneous CPU-GPU clusters

K Sajjapongse, T Agarwal… - 2014 21st International …, 2014 - ieeexplore.ieee.org
In the last few years, thanks to their computational power and progressively increased
programmability, GPUs have become part of HPC clusters. As a result, widely used open …

CRAC: An automatic assistant compiler of checkpoint/restart for OpenCL program

G Chen, J Zhang, Z Zhu, H Wang… - Concurrency and …, 2022 - Wiley Online Library
Nowadays, people use multiple devices to meet the growing requirement for computing.
With the application of multicard computing, fault tolerance, load balance, and resource …

Device-sensitive framework for handling heterogeneous asymmetric clusters efficiently

V Raca, E Mehofer - 2015 27th International Symposium on …, 2015 - ieeexplore.ieee.org
Heterogeneous systems with different types of compute devices are common nowadays in
the field of High Performance Computing (HPC). This heterogeneity is not limited to compute …