A survey of methods for analyzing and improving GPU energy efficiency

S Mittal, JS Vetter - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
Recent years have witnessed phenomenal growth in the computational capabilities and
applications of GPUs. However, this trend has also led to a dramatic increase in their power …

{MLaaS} in the wild: Workload analysis and scheduling in {Large-Scale} heterogeneous {GPU} clusters

Q Weng, W **ao, Y Yu, W Wang, C Wang, J He… - … USENIX Symposium on …, 2022 - usenix.org
With the sustained technological advances in machine learning (ML) and the availability of
massive datasets recently, tech companies are deploying large ML-as-a-Service (MLaaS) …

Deep learning workload scheduling in gpu datacenters: A survey

Z Ye, W Gao, Q Hu, P Sun, X Wang, Y Luo… - ACM Computing …, 2024 - dl.acm.org
Deep learning (DL) has demonstrated its remarkable success in a wide variety of fields. The
development of a DL model is a time-consuming and resource-intensive procedure. Hence …

[BOOK][B] Distributed and cloud computing: from parallel processing to the internet of things

K Hwang, J Dongarra, GC Fox - 2013 - books.google.com
Distributed and Cloud Computing: From Parallel Processing to the Internet of Things offers
complete coverage of modern distributed computing technology including clusters, the grid …

Context-aware sequential recommendation

Q Liu, S Wu, D Wang, Z Li… - 2016 IEEE 16th …, 2016 - ieeexplore.ieee.org
Since sequential information plays an important role in modeling user behaviors, various
sequential recommendation methods have been proposed. Methods based on Markov …

rCUDA: Reducing the number of GPU-based accelerators in high performance clusters

J Duato, AJ Pena, F Silla, R Mayo… - … Conference on High …, 2010 - ieeexplore.ieee.org
The increasing computing requirements for GPUs (Graphics Processing Units) have
favoured the design and marketing of commodity devices that nowadays can also be used to …

Transparent {GPU} sharing in container clouds for deep learning workloads

B Wu, Z Zhang, Z Bai, X Liu, X ** - 20th USENIX Symposium on …, 2023 - usenix.org
Containers are widely used for resource management in datacenters. A common practice to
support deep learning (DL) training in container clouds is to statically bind GPUs to …

A GPGPU transparent virtualization component for high performance computing clouds

G Giunta, R Montella, G Agrillo, G Coviello - Euro-Par 2010-Parallel …, 2010 - Springer
Abstract The GPU Virtualization Service (gVirtuS) presented in this work tries to fill the gap
between in-house hosted computing clusters, equipped with GPGPUs devices, and pay-for …

Telekine: Secure computing with cloud {GPUs}

T Hunt, Z Jia, V Miller, A Szekely, Y Hu… - … USENIX Symposium on …, 2020 - usenix.org
GPUs have become ubiquitous in the cloud due to the dramatic performance gains they
enable in domains such as machine learning and computer vision. However, offloading …

A survey on scheduling techniques in computing and network convergence

S Tang, Y Yu, H Wang, G Wang, W Chen… - … Surveys & Tutorials, 2023 - ieeexplore.ieee.org
The computing demand for massive applications has led to the ubiquitous deployment of
computing power. This trend results in the urgent need for higher-level computing resource …