External memory algorithms and data structures: Dealing with massive data
JS Vitter - ACM Computing surveys (CsUR), 2001 - dl.acm.org
Data sets in large applications are often too massive to fit completely inside the computers
internal memory. The resulting input/output communication (or I/O) between fast internal …
internal memory. The resulting input/output communication (or I/O) between fast internal …
Firmware over-the-air programming techniques for IoT networks-A survey
K Arakadakis, P Charalampidis… - ACM Computing …, 2021 - dl.acm.org
The devices forming Internet of Things (IoT) networks need to be re-programmed over the
air, so that new features are added, software bugs or security vulnerabilities are resolved …
air, so that new features are added, software bugs or security vulnerabilities are resolved …
Linear work suffix array construction
J Kärkkäinen, P Sanders, S Burkhardt - Journal of the ACM (JACM), 2006 - dl.acm.org
Suffix trees and suffix arrays are widely used and largely interchangeable index structures
on strings and sequences. Practitioners prefer suffix arrays due to their simplicity and space …
on strings and sequences. Practitioners prefer suffix arrays due to their simplicity and space …
Efficient construction of an assembly string graph using the FM-index
JT Simpson, R Durbin - Bioinformatics, 2010 - academic.oup.com
Motivation: Sequence assembly is a difficult problem whose importance has grown again
recently as the cost of sequencing has dramatically dropped. Most new sequence assembly …
recently as the cost of sequencing has dramatically dropped. Most new sequence assembly …
Algorithms and data structures for external memory
JS Vitter - … and Trends® in Theoretical Computer Science, 2008 - nowpublishers.com
Data sets in large applications are often too massive to fit completely inside the computer's
internal memory. The resulting input/output communication (or I/O) between fast internal …
internal memory. The resulting input/output communication (or I/O) between fast internal …
STXXL: standard template library for XXL data sets
R Dementiev, L Kettner… - Software: Practice and …, 2008 - Wiley Online Library
We present the software library Stxxl that is an implementation of the C++ standard template
library (STL) for processing huge data sets that can fit only on hard disks. It supports parallel …
library (STL) for processing huge data sets that can fit only on hard disks. It supports parallel …
Permuted longest-common-prefix array
The longest-common-prefix (LCP) array is an adjunct to the suffix array that allows many
string processing problems to be solved in optimal time and space. Its construction is a …
string processing problems to be solved in optimal time and space. Its construction is a …
Fast and accurate read map** with approximate seeds and multiple backtracking
We present Masai, a read mapper representing the state-of-the-art in terms of speed and
accuracy. Our tool is an order of magnitude faster than RazerS 3 and mrFAST, 2–4 times …
accuracy. Our tool is an order of magnitude faster than RazerS 3 and mrFAST, 2–4 times …
Practical linear-time O(1)-workspace suffix sorting for constant alphabets
G Nong - ACM Transactions on Information Systems (TOIS), 2013 - dl.acm.org
This article presents an O (n)-time algorithm called SACA-K for sorting the suffixes of an
input string T [0, n-1] over an alphabet A [0, K-1]. The problem of sorting the suffixes of T is …
input string T [0, n-1] over an alphabet A [0, K-1]. The problem of sorting the suffixes of T is …
Lightweight data indexing and compression in external memory
In this paper we describe algorithms for computing the Burrows-Wheeler Transform (bwt)
and for building (compressed) indexes in external memory. The innovative feature of our …
and for building (compressed) indexes in external memory. The innovative feature of our …