Benchmarking big data systems: A review

R Han, LK John, J Zhan - IEEE Transactions on Services …, 2017 - ieeexplore.ieee.org
With the fast development of big data systems in recent years, a variety of open-source
benchmarks have been built to evaluate and compare the workloads on these systems, and …

Speeding up distributed machine learning using codes

K Lee, M Lam, R Pedarsani… - IEEE Transactions …, 2017 - ieeexplore.ieee.org
Codes are widely used in many engineering applications to offer robustness against noise.
In large-scale systems, there are several types of noise that can affect the performance of …

Serving {DNNs} like clockwork: Performance predictability from the bottom up

A Gujarati, R Karimi, S Alzayat, W Hao… - … USENIX Symposium on …, 2020 - usenix.org
Machine learning inference is becoming a core building block for interactive web
applications. As a result, the underlying model serving systems on which these applications …

Efficient memory disaggregation with infiniswap

J Gu, Y Lee, Y Zhang, M Chowdhury… - 14th USENIX Symposium …, 2017 - usenix.org
Memory-intensive applications suffer large performance loss when their working sets do not
fully fit in memory. Yet, they cannot leverage otherwise unused remote memory when paging …

Cluster frameworks for efficient scheduling and resource allocation in data center networks: A survey

K Wang, Q Zhou, S Guo, J Luo - IEEE Communications Surveys …, 2018 - ieeexplore.ieee.org
Data centers are widely used for big data analytics, which often involve data-parallel jobs,
including query and web service. Meanwhile, cluster frameworks are rapidly developed for …

Quasar: Resource-efficient and qos-aware cluster management

C Delimitrou, C Kozyrakis - ACM Sigplan Notices, 2014 - dl.acm.org
Cloud computing promises flexibility and high performance for users and high cost-efficiency
for operators. Nevertheless, most cloud facilities operate at very low utilization, hurting both …

Making sense of performance in data analytics frameworks

K Ousterhout, R Rasti, S Ratnasamy… - … USENIX Symposium on …, 2015 - usenix.org
There has been much research devoted to improving the performance of data analytics
frameworks, but comparatively little effort has been spent systematically identifying the …

Low latency geo-distributed data analytics

Q Pu, G Ananthanarayanan, P Bodik… - ACM SIGCOMM …, 2015 - dl.acm.org
Low latency analytics on geographically distributed datasets (across datacenters, edge
clusters) is an upcoming and increasingly important challenge. The dominant approach of …

Sprocket: A serverless video processing framework

L Ao, L Izhikevich, GM Voelker, G Porter - Proceedings of the ACM …, 2018 - dl.acm.org
Sprocket is a highly configurable, stage-based, scalable, serverless video processing
framework that exploits intra-video parallelism to achieve low latency. Sprocket enables …

Coded computing for low-latency federated learning over wireless edge networks

S Prakash, S Dhakal, MR Akdeniz… - IEEE Journal on …, 2020 - ieeexplore.ieee.org
Federated learning enables training a global model from data located at the client nodes,
without data sharing and moving client data to a centralized server. Performance of …