No one (cluster) size fits all: automatic cluster sizing for data-intensive analytics
Infrastructure-as-a-Service (IaaS) cloud platforms have brought two unprecedented changes
to cluster provisioning practices. First, any (nonexpert) user can provision a cluster of any …
to cluster provisioning practices. First, any (nonexpert) user can provision a cluster of any …
TRACON: Interference-aware scheduling for data-intensive applications in virtualized environments
Large-scale data centers leverage virtualization technology to achieve excellent resource
utilization, scalability, and high availability. Ideally, the performance of an application …
utilization, scalability, and high availability. Ideally, the performance of an application …
Internal parallelism of flash memory-based solid-state drives
A unique merit of a solid-state drive (SSD) is its internal parallelism. In this article, we
present a set of comprehensive studies on understanding and exploiting internal parallelism …
present a set of comprehensive studies on understanding and exploiting internal parallelism …
Differentiated storage services
We propose an I/O classification architecture to close the widening semantic gap between
computer systems and storage systems. By classifying I/O, a computer system can request …
computer systems and storage systems. By classifying I/O, a computer system can request …
Distributed storage resource scheduler and load balancer
A Gulati, I Ahmad, C Kumar - US Patent 8,914,598, 2014 - Google Patents
Distributed storage resources are managed based on data collected from online monitoring
of workloads on the storage resources and performance characteristics of the storage …
of workloads on the storage resources and performance characteristics of the storage …
Online anomaly prediction for robust cluster systems
In this paper, we present a stream-based mining algorithm for online anomaly prediction.
Many real-world applications such as data stream analysis requires continuous cluster …
Many real-world applications such as data stream analysis requires continuous cluster …
[PDF][PDF] BASIL: Automated IO Load Balancing Across Storage Devices.
A Gulati, C Kumar, I Ahmad, K Kumar - Fast, 2010 - usenix.org
Live migration of virtual hard disks between storage arrays has long been possible.
However, there is a dearth of online tools to perform automated virtual disk placement and …
However, there is a dearth of online tools to perform automated virtual disk placement and …
Pesto: online storage performance management in virtualized datacenters
A Gulati, G Shanmuganathan, I Ahmad… - Proceedings of the 2nd …, 2011 - dl.acm.org
Virtualized datacenters strive to reduce costs through workload consolidation. Workloads
exhibit a diverse set of IO behaviors and varying IO load that makes it difficult to estimate the …
exhibit a diverse set of IO behaviors and varying IO load that makes it difficult to estimate the …
Verifiable resource accounting for cloud computing services
Cloud computing offers users the potential to reduce operating and capital expenses by
leveraging the amortization benefits offered by large, managed infrastructures. However, the …
leveraging the amortization benefits offered by large, managed infrastructures. However, the …
Ironmodel: Robust performance models in the wild
E Thereska, GR Ganger - ACM SIGMETRICS Performance Evaluation …, 2008 - dl.acm.org
Traditional performance models are too brittle to be relied on for continuous capacity
planning and performance debugging in many computer systems. Simply put, a brittle model …
planning and performance debugging in many computer systems. Simply put, a brittle model …