Paving the way for NFV acceleration: A taxonomy, survey and future directions
As a recent innovation, network functions virtualization (NFV)—with its core concept of
replacing hardware middleboxes with software network functions (NFs) implemented in …
replacing hardware middleboxes with software network functions (NFs) implemented in …
Gslice: controlled spatial sharing of gpus for a scalable inference platform
The increasing demand for cloud-based inference services requires the use of Graphics
Processing Unit (GPU). It is highly desirable to utilize GPU efficiently by multiplexing different …
Processing Unit (GPU). It is highly desirable to utilize GPU efficiently by multiplexing different …
E3:{Energy-Efficient} microservices on {SmartNIC-Accelerated} servers
We investigate the use of SmartNIC-accelerated servers to execute microservice-based
applications in the data center. By offloading suitable microservices to the SmartNIC's low …
applications in the data center. By offloading suitable microservices to the SmartNIC's low …
Transparent {GPU} sharing in container clouds for deep learning workloads
Containers are widely used for resource management in datacenters. A common practice to
support deep learning (DL) training in container clouds is to statically bind GPUs to …
support deep learning (DL) training in container clouds is to statically bind GPUs to …
Heimdall: mobile GPU coordination platform for augmented reality applications
We present Heimdall, a mobile GPU coordination platform for emerging Augmented Reality
(AR) applications. Future AR apps impose an explored challenging workload: i) concurrent …
(AR) applications. Future AR apps impose an explored challenging workload: i) concurrent …
{NICA}: An infrastructure for inline acceleration of network applications
With rising network rates, cloud vendors increasingly deploy FPGA-based SmartNICs (F-
NICs), leveraging their inline processing capabilities to offload hypervisor networking …
NICs), leveraging their inline processing capabilities to offload hypervisor networking …
Salus: Fine-grained gpu sharing primitives for deep learning applications
GPU computing is becoming increasingly more popular with the proliferation of deep
learning (DL) applications. However, unlike traditional resources such as CPU or the …
learning (DL) applications. However, unlike traditional resources such as CPU or the …
PacketMill: toward per-Core 100-Gbps networking
We present PacketMill, a system for optimizing software packet processing, which (i)
introduces a new model to efficiently manage packet metadata and (ii) employs code …
introduces a new model to efficiently manage packet metadata and (ii) employs code …
Towards enhancing the reproducibility of deep learning bugs: an empirical study
Context Deep learning has achieved remarkable progress in various domains. However,
like any software system, deep learning systems contain bugs, some of which can have …
like any software system, deep learning systems contain bugs, some of which can have …
Fine-grained GPU sharing primitives for deep learning applications
Unlike traditional resources such as CPU or the network, modern GPUs do not natively
support fine-grained sharing primitives. Consequently, implementing common policies such …
support fine-grained sharing primitives. Consequently, implementing common policies such …