A survey of methods for analyzing and improving GPU energy efficiency
Recent years have witnessed phenomenal growth in the computational capabilities and
applications of GPUs. However, this trend has also led to a dramatic increase in their power …
applications of GPUs. However, this trend has also led to a dramatic increase in their power …
{MLaaS} in the wild: Workload analysis and scheduling in {Large-Scale} heterogeneous {GPU} clusters
With the sustained technological advances in machine learning (ML) and the availability of
massive datasets recently, tech companies are deploying large ML-as-a-Service (MLaaS) …
massive datasets recently, tech companies are deploying large ML-as-a-Service (MLaaS) …
Deep learning workload scheduling in gpu datacenters: A survey
Deep learning (DL) has demonstrated its remarkable success in a wide variety of fields. The
development of a DL model is a time-consuming and resource-intensive procedure. Hence …
development of a DL model is a time-consuming and resource-intensive procedure. Hence …
[BOOK][B] Distributed and cloud computing: from parallel processing to the internet of things
Distributed and Cloud Computing: From Parallel Processing to the Internet of Things offers
complete coverage of modern distributed computing technology including clusters, the grid …
complete coverage of modern distributed computing technology including clusters, the grid …
Context-aware sequential recommendation
Since sequential information plays an important role in modeling user behaviors, various
sequential recommendation methods have been proposed. Methods based on Markov …
sequential recommendation methods have been proposed. Methods based on Markov …
rCUDA: Reducing the number of GPU-based accelerators in high performance clusters
The increasing computing requirements for GPUs (Graphics Processing Units) have
favoured the design and marketing of commodity devices that nowadays can also be used to …
favoured the design and marketing of commodity devices that nowadays can also be used to …
Transparent {GPU} sharing in container clouds for deep learning workloads
Containers are widely used for resource management in datacenters. A common practice to
support deep learning (DL) training in container clouds is to statically bind GPUs to …
support deep learning (DL) training in container clouds is to statically bind GPUs to …
A GPGPU transparent virtualization component for high performance computing clouds
Abstract The GPU Virtualization Service (gVirtuS) presented in this work tries to fill the gap
between in-house hosted computing clusters, equipped with GPGPUs devices, and pay-for …
between in-house hosted computing clusters, equipped with GPGPUs devices, and pay-for …
Telekine: Secure computing with cloud {GPUs}
GPUs have become ubiquitous in the cloud due to the dramatic performance gains they
enable in domains such as machine learning and computer vision. However, offloading …
enable in domains such as machine learning and computer vision. However, offloading …
A survey on scheduling techniques in computing and network convergence
S Tang, Y Yu, H Wang, G Wang, W Chen… - … Surveys & Tutorials, 2023 - ieeexplore.ieee.org
The computing demand for massive applications has led to the ubiquitous deployment of
computing power. This trend results in the urgent need for higher-level computing resource …
computing power. This trend results in the urgent need for higher-level computing resource …