Performance interference of virtual machines: A survey
The rapid development of cloud computing with virtualization technology has benefited both
academia and industry. For any cloud data center at scale, one of the primary challenges is …
academia and industry. For any cloud data center at scale, one of the primary challenges is …
{FIRM}: An intelligent fine-grained resource management framework for {SLO-Oriented} microservices
User-facing latency-sensitive web services include numerous distributed,
intercommunicating microservices that promise to simplify software development and …
intercommunicating microservices that promise to simplify software development and …
Parties: Qos-aware resource partitioning for multiple interactive services
Multi-tenancy in modern datacenters is currently limited to a single latency-critical,
interactive service, running alongside one or more low-priority, best-effort jobs. This limits …
interactive service, running alongside one or more low-priority, best-effort jobs. This limits …
Seer: Leveraging big data to navigate the complexity of performance debugging in cloud microservices
Y Gan, Y Zhang, K Hu, D Cheng, Y He… - Proceedings of the …, 2019 - dl.acm.org
Performance unpredictability is a major roadblock towards cloud adoption, and has
performance, cost, and revenue ramifications. Predictable performance is even more critical …
performance, cost, and revenue ramifications. Predictable performance is even more critical …
Quasar: Resource-efficient and qos-aware cluster management
Cloud computing promises flexibility and high performance for users and high cost-efficiency
for operators. Nevertheless, most cloud facilities operate at very low utilization, hurting both …
for operators. Nevertheless, most cloud facilities operate at very low utilization, hurting both …
Motion-appearance co-memory networks for video question answering
Abstract Video Question Answering (QA) is an important task in understanding video
temporal structure. We observe that there are three unique attributes of video QA compared …
temporal structure. We observe that there are three unique attributes of video QA compared …
Reconciling high server utilization and sub-millisecond quality-of-service
The simplest strategy to guarantee good quality of service (QoS) for a latency-sensitive
workload with sub-millisecond latency in a shared cluster environment is to never run other …
workload with sub-millisecond latency in a shared cluster environment is to never run other …
Tarcil: Reconciling scheduling speed and quality in large shared clusters
Scheduling diverse applications in large, shared clusters is particularly challenging. Recent
research on cluster scheduling focuses either on scheduling speed, using sampling to …
research on cluster scheduling focuses either on scheduling speed, using sampling to …
Prophet: Precise qos prediction on non-preemptive accelerators to improve utilization in warehouse-scale computers
Guaranteeing Quality-of-Service (QoS) of latency-sensitive applications while improving
server utilization through application co-location is important yet challenging in modern …
server utilization through application co-location is important yet challenging in modern …
Twig: Multi-agent task management for colocated latency-critical cloud services
Many of the important services running on data centres are latency-critical, time-varying, and
demand strict user satisfaction. Stringent tail-latency targets for colocated services and …
demand strict user satisfaction. Stringent tail-latency targets for colocated services and …