When deduplication meets migration: An efficient and adaptive strategy in distributed storage systems

G Cheng, L Luo, J **a, D Guo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The traditional migration methods are confronted with formidable challenges when data
deduplication technologies are incorporated. First, the deduplication creates data-sharing …

The doctrine of mean: Realizing deduplication storage at unreliable edge

J **a, G Cheng, L Luo, D Guo, P Lv… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Placing popular data at the network edge helps reduce the retrieval latency, but it also
brings challenges to the limited edge storage space. Currently, using available yet not …

Ripple: Enabling decentralized data deduplication at the edge

R Luo, Q He, F Chen, S Wu, H **… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
With its advantages in ensuring low data retrieval latency and reducing backhaul network
traffic, edge computing is becoming a backbone solution for many latency-sensitive …

{InftyDedup}: Scalable and {Cost-Effective} Cloud Tiering with Deduplication

I Kotlarska, A Jackowski, K Lichota, M Welnicki… - … USENIX Conference on …, 2023 - usenix.org
Cloud tiering is the process of moving selected data from on-premise storage to the cloud,
which has recently become important for backup solutions. As subsequent backups usually …

Physical vs. Logical Indexing with {IDEA}: Inverted {Deduplication-Aware} Index

A Levi, P Shilane, S Sheinvald, G Yadgar - 22nd USENIX Conference on …, 2024 - usenix.org
In the realm of information retrieval, the need to maintain reliable term-indexing has grown
more acute in recent years, with vast amounts of ever-growing online data searched by a …

Dataset Similarity Detection for Global Deduplication in the DD File System

T Wong, S Thakkar, KF Hsieh, Z Tom… - 2023 IEEE 39th …, 2023 - ieeexplore.ieee.org
Deduplication has become a widely used technique to reduce space requirements for
storage systems by replacing redundant chunks of data with references. While storage …

[HTML][HTML] Speed-Dedup: A New Deduplication Framework for Enhanced Performance and Reduced Overhead in Scale-Out Storage

P Hamandawana, DJ Cho, TS Chung - Electronics, 2024 - mdpi.com
Conventional deduplication systems face critical challenges such as excessive write
amplification, high read/write latency, and sub-optimal storage utilization. These limitations …

IBNR-RD: Intra-Block Neighborhood Relationship-Based Resemblance Detection for High-Performance Multi-Node Post-Deduplication

D Zeng, W Tian, T He, R Li, X Ye… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Post-deduplication in traditional cloud environments primarily focuses on single-node,
where delta compression is performed on the same deduplication node located on server …

CPI: A Collaborative Partial Indexing Design for Large-Scale Deduplication Systems

Z Cao, DHC Du - IEEE Transactions on Computers, 2024 - ieeexplore.ieee.org
Data deduplication relies on a chunk index to identify the redundancy of incoming chunks.
As backup data scales, it is impractical to maintain the entire chunk index in memory …

UltraCDC: A Fast and Stable Content-Defined Chunking Algorithm for Deduplication-based Backup Storage Systems

P Zhou, Z Wang, W **a, H Zhang - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Content-Defined Chunking (CDC) is the key stage of data deduplication since it has a
significant impact on deduplication system's throughput and deduplication efficiency …