Benchmarking big data systems: A review
With the fast development of big data systems in recent years, a variety of open-source
benchmarks have been built to evaluate and compare the workloads on these systems, and …
benchmarks have been built to evaluate and compare the workloads on these systems, and …
Speeding up distributed machine learning using codes
Codes are widely used in many engineering applications to offer robustness against noise.
In large-scale systems, there are several types of noise that can affect the performance of …
In large-scale systems, there are several types of noise that can affect the performance of …
Serving {DNNs} like clockwork: Performance predictability from the bottom up
Machine learning inference is becoming a core building block for interactive web
applications. As a result, the underlying model serving systems on which these applications …
applications. As a result, the underlying model serving systems on which these applications …
Efficient memory disaggregation with infiniswap
Memory-intensive applications suffer large performance loss when their working sets do not
fully fit in memory. Yet, they cannot leverage otherwise unused remote memory when paging …
fully fit in memory. Yet, they cannot leverage otherwise unused remote memory when paging …
Cluster frameworks for efficient scheduling and resource allocation in data center networks: A survey
Data centers are widely used for big data analytics, which often involve data-parallel jobs,
including query and web service. Meanwhile, cluster frameworks are rapidly developed for …
including query and web service. Meanwhile, cluster frameworks are rapidly developed for …
Quasar: Resource-efficient and qos-aware cluster management
Cloud computing promises flexibility and high performance for users and high cost-efficiency
for operators. Nevertheless, most cloud facilities operate at very low utilization, hurting both …
for operators. Nevertheless, most cloud facilities operate at very low utilization, hurting both …
Making sense of performance in data analytics frameworks
There has been much research devoted to improving the performance of data analytics
frameworks, but comparatively little effort has been spent systematically identifying the …
frameworks, but comparatively little effort has been spent systematically identifying the …
Low latency geo-distributed data analytics
Low latency analytics on geographically distributed datasets (across datacenters, edge
clusters) is an upcoming and increasingly important challenge. The dominant approach of …
clusters) is an upcoming and increasingly important challenge. The dominant approach of …
Sprocket: A serverless video processing framework
Sprocket is a highly configurable, stage-based, scalable, serverless video processing
framework that exploits intra-video parallelism to achieve low latency. Sprocket enables …
framework that exploits intra-video parallelism to achieve low latency. Sprocket enables …
Coded computing for low-latency federated learning over wireless edge networks
Federated learning enables training a global model from data located at the client nodes,
without data sharing and moving client data to a centralized server. Performance of …
without data sharing and moving client data to a centralized server. Performance of …