Performance interference of virtual machines: A survey

W Lin, C **ong, W Wu, F Shi, K Li, M Xu - ACM Computing Surveys, 2023 - dl.acm.org
The rapid development of cloud computing with virtualization technology has benefited both
academia and industry. For any cloud data center at scale, one of the primary challenges is …

{FIRM}: An intelligent fine-grained resource management framework for {SLO-Oriented} microservices

H Qiu, SS Banerjee, S Jha, ZT Kalbarczyk… - 14th USENIX symposium …, 2020 - usenix.org
User-facing latency-sensitive web services include numerous distributed,
intercommunicating microservices that promise to simplify software development and …

Parties: Qos-aware resource partitioning for multiple interactive services

S Chen, C Delimitrou, JF Martínez - Proceedings of the Twenty-Fourth …, 2019 - dl.acm.org
Multi-tenancy in modern datacenters is currently limited to a single latency-critical,
interactive service, running alongside one or more low-priority, best-effort jobs. This limits …

Seer: Leveraging big data to navigate the complexity of performance debugging in cloud microservices

Y Gan, Y Zhang, K Hu, D Cheng, Y He… - Proceedings of the …, 2019 - dl.acm.org
Performance unpredictability is a major roadblock towards cloud adoption, and has
performance, cost, and revenue ramifications. Predictable performance is even more critical …

Quasar: Resource-efficient and qos-aware cluster management

C Delimitrou, C Kozyrakis - ACM Sigplan Notices, 2014 - dl.acm.org
Cloud computing promises flexibility and high performance for users and high cost-efficiency
for operators. Nevertheless, most cloud facilities operate at very low utilization, hurting both …

Motion-appearance co-memory networks for video question answering

J Gao, R Ge, K Chen, R Nevatia - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Abstract Video Question Answering (QA) is an important task in understanding video
temporal structure. We observe that there are three unique attributes of video QA compared …

Reconciling high server utilization and sub-millisecond quality-of-service

J Leverich, C Kozyrakis - … of the Ninth European Conference on …, 2014 - dl.acm.org
The simplest strategy to guarantee good quality of service (QoS) for a latency-sensitive
workload with sub-millisecond latency in a shared cluster environment is to never run other …

Tarcil: Reconciling scheduling speed and quality in large shared clusters

C Delimitrou, D Sanchez, C Kozyrakis - … of the Sixth ACM Symposium on …, 2015 - dl.acm.org
Scheduling diverse applications in large, shared clusters is particularly challenging. Recent
research on cluster scheduling focuses either on scheduling speed, using sampling to …

Prophet: Precise qos prediction on non-preemptive accelerators to improve utilization in warehouse-scale computers

Q Chen, H Yang, M Guo, RS Kannan, J Mars… - Proceedings of the …, 2017 - dl.acm.org
Guaranteeing Quality-of-Service (QoS) of latency-sensitive applications while improving
server utilization through application co-location is important yet challenging in modern …

Twig: Multi-agent task management for colocated latency-critical cloud services

R Nishtala, V Petrucci, P Carpenter… - … Symposium on High …, 2020 - ieeexplore.ieee.org
Many of the important services running on data centres are latency-critical, time-varying, and
demand strict user satisfaction. Stringent tail-latency targets for colocated services and …