Dynamic and scalable DNA-based information storage
The physical architectures of information storage systems often dictate how information is
encoded, databases are organized, and files are accessed. Here we show that a simple …
encoded, databases are organized, and files are accessed. Here we show that a simple …
Hands: A heuristically arranged non-backup in-line deduplication system
Deduplicating in-line data on primary storage is hampered by the disk bottleneck problem,
an issue which results from the need to keep an index map** portions of data to hash …
an issue which results from the need to keep an index map** portions of data to hash …
Parity Logging with Reserved Space: Towards {Efficient} Updates and Recovery in Erasure-coded Clustered Storage
Many modern storage systems adopt erasure coding to provide data availability guarantees
with low redundancy. Log-based storage is often used to append new data rather than …
with low redundancy. Log-based storage is often used to append new data rather than …
[PDF][PDF] Emulation & virtualization as preservation strategies
DSH Rosenthal - Andrew W. Mellon Foundation, 2015 - stanford.edu
Between the two fundamental digital preservation strategies, migration has been strongly
favored. Recent developments in emulation frameworks make it possible to deliver …
favored. Recent developments in emulation frameworks make it possible to deliver …
Ta-update: An adaptive update scheme with tree-structured transmission in erasure-coded storage systems
Erasure coding has received considerable attentions due to the better tradeoff between the
space efficiency and reliability. The frequent update of the stored data in the distributed …
space efficiency and reliability. The frequent update of the stored data in the distributed …
Analysis of the {ECMWF} Storage Landscape
Despite domain-specific digital archives are growing in number and size, there is a lack of
studies describing their architectures and runtime characteristics. This paper investigates the …
studies describing their architectures and runtime characteristics. This paper investigates the …
Access patterns for robots and humans in web archives
Although user access patterns on the live web are well-understood, there has been no
corresponding study of how users, both humans and robots, access web archives. Based on …
corresponding study of how users, both humans and robots, access web archives. Based on …
DNA Bloom Filter enables anti-contamination and file version control for DNA-based data storage
DNA storage is one of the most promising ways for future information storage due to its high
data storage density, durable storage time and low maintenance cost. However, errors are …
data storage density, durable storage time and low maintenance cost. However, errors are …
T-update: A tree-structured update scheme with top-down transmission in erasure-coded systems
Erasure coding has received considerable attention due to the better tradeoff between the
space efficiency and reliability. However, it consumes large network traffic and long time to …
space efficiency and reliability. However, it consumes large network traffic and long time to …
Tools for analyzing parallel I/O
Parallel application I/O performance often does not meet user expectations. Additionally,
slight access pattern modifications may lead to significant changes in performance due to …
slight access pattern modifications may lead to significant changes in performance due to …