Resource management in clouds: Survey and research challenges
Resource management in a cloud environment is a hard problem, due to: the scale of
modern data centers; the heterogeneity of resource types and their interdependencies; the …
modern data centers; the heterogeneity of resource types and their interdependencies; the …
A survey and classification of software-defined storage systems
The exponential growth of digital information is imposing increasing scale and efficiency
demands on modern storage infrastructures. As infrastructure complexity increases, so does …
demands on modern storage infrastructures. As infrastructure complexity increases, so does …
Reflex: Remote flash≈ local flash
Remote access to NVMe Flash enables flexible scaling and high utilization of Flash capacity
and IOPS within a datacenter. However, existing systems for remote Flash access either …
and IOPS within a datacenter. However, existing systems for remote Flash access either …
Ioflow: A software-defined storage architecture
In data centers, the IO path to storage is long and complex. It comprises many layers or"
stages" with opaque interfaces between them. This makes it hard to enforce end-to-end …
stages" with opaque interfaces between them. This makes it hard to enforce end-to-end …
Flash storage disaggregation
PCIe-based Flash is commonly deployed to provide datacenter applications with high IO
rates. However, its capacity and bandwidth are often underutilized as it is difficult to design …
rates. However, its capacity and bandwidth are often underutilized as it is difficult to design …
What bugs live in the cloud? a study of 3000+ issues in cloud systems
We conduct a comprehensive study of development and deployment issues of six popular
and important cloud systems (Hadoop MapReduce, HDFS, HBase, Cassandra, ZooKeeper …
and important cloud systems (Hadoop MapReduce, HDFS, HBase, Cassandra, ZooKeeper …
Retro: Targeted resource management in multi-tenant distributed systems
In distributed systems shared by multiple tenants, effective resource management is an
important pre-requisite to providing quality of service guarantees. Many systems deployed …
important pre-requisite to providing quality of service guarantees. Many systems deployed …
Aequitas: Admission control for performance-critical rpcs in datacenters
With the increasing popularity of disaggregated storage and microservice architectures, high
fan-out and fan-in Remote Procedure Calls (RPCs) now generate most of the traffic in …
fan-out and fan-in Remote Procedure Calls (RPCs) now generate most of the traffic in …
Prioritymeister: Tail latency qos for shared networked storage
Meeting service level objectives (SLOs) for tail latency is an important and challenging open
problem in cloud computing infrastructures. The challenges are exacerbated by burstiness …
problem in cloud computing infrastructures. The challenges are exacerbated by burstiness …
Nova-LSM: a distributed, component-based LSM-tree key-value store
The cloud infrastructure motivates disaggregation of monolithic data stores into components
that are assembled together based on an application's workload. This study investigates …
that are assembled together based on an application's workload. This study investigates …