A survey of secure data deduplication schemes for cloud storage systems
Data deduplication has attracted many cloud service providers (CSPs) as a way to reduce
storage costs. Even though the general deduplication approach has been increasingly …
storage costs. Even though the general deduplication approach has been increasingly …
A comprehensive study of the past, present, and future of data deduplication
Data deduplication, an efficient approach to data reduction, has gained increasing attention
and popularity in large-scale storage systems due to the explosive growth of digital data. It …
and popularity in large-scale storage systems due to the explosive growth of digital data. It …
{FastCDC}: A fast and efficient {Content-Defined} chunking approach for data deduplication
Content-Defined Chunking (CDC) has been playing a key role in data deduplication
systems in the past 15 years or so due to its high redundancy detection abil-ity. However …
systems in the past 15 years or so due to its high redundancy detection abil-ity. However …
The design of fast content-defined chunking for data deduplication based storage systems
Content-Defined Chunking (CDC) has been playing a key role in data deduplication
systems recently due to its high redundancy detection ability. However, existing CDC-based …
systems recently due to its high redundancy detection ability. However, existing CDC-based …
The design of fast and lightweight resemblance detection for efficient post-deduplication delta compression
Post-deduplication delta compression is a data reduction technique that calculates and
stores the differences of very similar but non-duplicate chunks in storage systems, which is …
stores the differences of very similar but non-duplicate chunks in storage systems, which is …
{DupHunter}: Flexible {High-Performance} Deduplication for Docker Registries
N Zhao, H Albahar, S Abraham, K Chen… - 2020 USENIX Annual …, 2020 - usenix.org
Containers are increasingly used in a broad spectrum of applications from cloud services to
storage to supporting emerging edge computing paradigm. This has led to an explosive …
storage to supporting emerging edge computing paradigm. This has led to an explosive …
A fast asymmetric extremum content defined chunking algorithm for data deduplication in backup storage systems
Chunk-level deduplication plays an important role in backup storage systems. Existing
Content-Defined Chunking (CDC) algorithms, while robust in finding suitable chunk …
Content-Defined Chunking (CDC) algorithms, while robust in finding suitable chunk …
The dilemma between deduplication and locality: Can both be achieved?
Data deduplication is widely used to reduce the size of backup workloads, but it has the
known disadvantage of causing poor data locality, also referred to as the fragmentation …
known disadvantage of causing poor data locality, also referred to as the fragmentation …
Finesse:{Fine-Grained} Feature Locality based Fast Resemblance Detection for {Post-Deduplication} Delta Compression
In storage systems, delta compression is often used as a complementary data reduction
technique for data deduplication because it is able to eliminate redundancy among the non …
technique for data deduplication because it is able to eliminate redundancy among the non …
The dynamic cuckoo filter
The emergence of large-scale dynamic sets in real applications creates stringent
requirements for approximate set representation structures: 1) the capacity of the set …
requirements for approximate set representation structures: 1) the capacity of the set …