[HTML][HTML] Fast relative Lempel–Ziv self-index for similar sequences
Recent advances in biotechnology and web technology are continuously generating huge
collections of similar strings. People now face the problem of storing them compactly while …
collections of similar strings. People now face the problem of storing them compactly while …
Stronger Lempel-Ziv based compressed text indexing
Abstract Given a text T [1.. u] over an alphabet of size σ, the full-text search problem consists
in finding the occ occurrences of a given pattern P [1.. m] in T. In indexed text searching we …
in finding the occ occurrences of a given pattern P [1.. m] in T. In indexed text searching we …
Fully-compressed suffix trees
Suffix trees are by far the most important data structure in stringology, with myriads of
applications in fields like bioinformatics and information retrieval. Classical representations …
applications in fields like bioinformatics and information retrieval. Classical representations …
Implementing the LZ-index: Theory versus practice
G Navarro - Journal of Experimental Algorithmics (JEA), 2009 - dl.acm.org
The LZ-index is a theoretical proposal of a lightweight data structure for text indexing, based
on the Ziv-Lempel trie. If a text of u characters over an alphabet of size σ is compressible to n …
on the Ziv-Lempel trie. If a text of u characters over an alphabet of size σ is compressible to n …
A compressed self-index using a Ziv–Lempel dictionary
A compressed full-text self-index for a text T, of size u, is a data structure used to search for
patterns P, of size m, in T, that requires reduced space, ie space that depends on the …
patterns P, of size m, in T, that requires reduced space, ie space that depends on the …
Compressing DNA sequence databases with coil
WTJ White, MD Hendy - BMC bioinformatics, 2008 - Springer
Background Publicly available DNA sequence databases such as GenBank are large, and
are growing at an exponential rate. The sheer volume of data being dealt with presents …
are growing at an exponential rate. The sheer volume of data being dealt with presents …
Dynamic fully-compressed suffix trees
Suffix trees are by far the most important data structure in stringology, with myriads of
applications in fields like bioinformatics, data compression and information retrieval …
applications in fields like bioinformatics, data compression and information retrieval …
Approximate string matching with Lempel-Ziv compressed indexes
A compressed full-text self-index for a text T is a data structure requiring reduced space and
able of searching for patterns P in T. Furthermore, the structure can reproduce any substring …
able of searching for patterns P in T. Furthermore, the structure can reproduce any substring …
Smaller and faster Lempel-Ziv indices
D Arroyuelo, G Navarro - 2007 - repositorio.uchile.cl
Given a text T [1.. u] over an alphabet of size σ= O (polylog (u)) and with k-th order empirical
entropy Hk (T), we propose a new compressed full-text self-index based on the Lempel-Ziv …
entropy Hk (T), we propose a new compressed full-text self-index based on the Lempel-Ziv …
Indexed hierarchical approximate string matching
We present a new search procedure for approximate string matching over suffix trees. We
show that hierarchical verification, which is a well-established technique for on-line …
show that hierarchical verification, which is a well-established technique for on-line …