Serverless computing on heterogeneous computers
Existing serverless computing platforms are built upon homogeneous computers, limiting the
function density and restricting serverless computing to limited scenarios. We introduce …
function density and restricting serverless computing to limited scenarios. We introduce …
Eleos: ExitLess OS services for SGX enclaves
Intel Software Guard extensions (SGX) enable secure and trusted execution of user code in
an isolated enclave to protect against a powerful adversary. Unfortunately, running I/O …
an isolated enclave to protect against a powerful adversary. Unfortunately, running I/O …
LITE kernel RDMA support for datacenter applications
Recently, there is an increasing interest in building data-center applications with RDMA
because of its low-latency, high-throughput, and low-CPU-utilization benefits. However …
because of its low-latency, high-throughput, and low-CPU-utilization benefits. However …
High performance packet processing with flexnic
The recent surge of network I/O performance has put enormous pressure on memory and
software I/O processing sub systems. We argue that the primary reason for high memory and …
software I/O processing sub systems. We argue that the primary reason for high memory and …
{NICA}: An infrastructure for inline acceleration of network applications
With rising network rates, cloud vendors increasingly deploy FPGA-based SmartNICs (F-
NICs), leveraging their inline processing capabilities to offload hypervisor networking …
NICs), leveraging their inline processing capabilities to offload hypervisor networking …
Lynx: A smartnic-driven accelerator-centric architecture for network servers
This paper explores new opportunities afforded by the growing deployment of compute and
I/O accelerators to improve the performance and efficiency of hardware-accelerated …
I/O accelerators to improve the performance and efficiency of hardware-accelerated …
Telekine: Secure computing with cloud {GPUs}
GPUs have become ubiquitous in the cloud due to the dramatic performance gains they
enable in domains such as machine learning and computer vision. However, offloading …
enable in domains such as machine learning and computer vision. However, offloading …
G10: Enabling an efficient unified gpu memory and storage architecture with smart tensor migrations
To break the GPU memory wall for scaling deep learning workloads, a variety of architecture
and system techniques have been proposed recently. Their typical approaches include …
and system techniques have been proposed recently. Their typical approaches include …
DeepMatch: Practical deep packet inspection in the data plane using network processors
Restricting data plane processing to packet headers precludes analysis of payloads to
improve routing and security decisions. DeepMatch delivers line-rate regular expression …
improve routing and security decisions. DeepMatch delivers line-rate regular expression …
[KİTAP][B] General-purpose graphics processor architectures
Originally developed to support video games, graphics processor units (GPUs) are now
increasingly used for general-purpose (non-graphics) applications ranging from machine …
increasingly used for general-purpose (non-graphics) applications ranging from machine …