Mystique: Enabling accurate and scalable generation of production ai benchmarks

M Liang, W Fu, L Feng, Z Lin, P Panakanti… - Proceedings of the 50th …, 2023 - dl.acm.org
Building large AI fleets to support the rapidly growing DL workloads is an active research
topic for modern cloud providers. Generating accurate benchmarks plays an essential role in …

Optimizing resource management for shared microservices: a scalable system design

S Luo, C Lin, K Ye, G Xu, L Zhang, G Yang… - ACM Transactions on …, 2024 - dl.acm.org
A common approach to improving resource utilization in data centers is to adaptively
provision resources based on the actual workload. One fundamental challenge of doing this …

TraceUpscaler: Upscaling Traces to Evaluate Systems at High Load

SM Sajal, T Zhu, B Urgaonkar, S Sen - Proceedings of the Nineteenth …, 2024 - dl.acm.org
Trace replay is a common approach for evaluating systems by rerunning historical traffic
patterns, but it is not always possible to find suitable real-world traces at the desired level of …

Derm: SLA-aware Resource Management for Highly Dynamic Microservices

L Chen, S Luo, C Lin, Z Mo, H Xu… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Ensuring efficient resource allocation while providing service level agreement (SLA)
guarantees for end-to-end (E2E) latency is crucial for microservice applications. Although …

DNSScope: Fine-Grained DNS Cache Probing for Remote Network Activity Characterization

J Li, Z Lin, X Ma, J Li, J Qu, X Luo… - IEEE INFOCOM 2024 …, 2024 - ieeexplore.ieee.org
The domain name system (DNS) is indispensable to nearly every Internet service. It has
been extensively utilized for network activity characterization in passive and active …

End-to-End Cloud Application Cloning With Ditto

M Liang, Y Gan, Y Li, C Torres, A Dhanotia… - IEEE Micro, 2024 - ieeexplore.ieee.org
The lack of publicly available cloud services has been a recurring problem in architecture
and systems. Although open source benchmarks exist, they do not capture the complexity of …

Exploring Imbalances among Microservice Containers in Large Cloud Platforms

C Lin, S Luo, H Xu - 2023 IEEE Intl Conf on Parallel & …, 2023 - ieeexplore.ieee.org
As software systems evolve, monolithic applications are often transformed into a collection of
lightweight and loosely-coupled microservices. Understanding the intricacies of …

Improving the Fidelity of Trace-Driven Experiments in Cloud Computing Systems

SM Sajal - 2024 - search.proquest.com
Realistic experimentation is an important part of research in computer systems and of
prototy** of new features and ideas in the industry. The ideal way to evaluate is to use real …