[HTML][HTML] Fast relative Lempel–Ziv self-index for similar sequences

HH Do, J Jansson, K Sadakane, WK Sung - Theoretical Computer Science, 2014 - Elsevier
Recent advances in biotechnology and web technology are continuously generating huge
collections of similar strings. People now face the problem of storing them compactly while …

Stronger Lempel-Ziv based compressed text indexing

D Arroyuelo, G Navarro, K Sadakane - Algorithmica, 2012 - Springer
Abstract Given a text T [1.. u] over an alphabet of size σ, the full-text search problem consists
in finding the occ occurrences of a given pattern P [1.. m] in T. In indexed text searching we …

Fully-compressed suffix trees

LMS Russo, G Navarro, AL Oliveira - … Symposium, Búzios, Brazil, April 7-11 …, 2008 - Springer
Suffix trees are by far the most important data structure in stringology, with myriads of
applications in fields like bioinformatics and information retrieval. Classical representations …

Implementing the LZ-index: Theory versus practice

G Navarro - Journal of Experimental Algorithmics (JEA), 2009 - dl.acm.org
The LZ-index is a theoretical proposal of a lightweight data structure for text indexing, based
on the Ziv-Lempel trie. If a text of u characters over an alphabet of size σ is compressible to n …

A compressed self-index using a Ziv–Lempel dictionary

LMS Russo, AL Oliveira - Information Retrieval, 2008 - Springer
A compressed full-text self-index for a text T, of size u, is a data structure used to search for
patterns P, of size m, in T, that requires reduced space, ie space that depends on the …

Compressing DNA sequence databases with coil

WTJ White, MD Hendy - BMC bioinformatics, 2008 - Springer
Background Publicly available DNA sequence databases such as GenBank are large, and
are growing at an exponential rate. The sheer volume of data being dealt with presents …

Dynamic fully-compressed suffix trees

LMS Russo, G Navarro, AL Oliveira - … , CPM 2008, Pisa, Italy, June 18-20 …, 2008 - Springer
Suffix trees are by far the most important data structure in stringology, with myriads of
applications in fields like bioinformatics, data compression and information retrieval …

Approximate string matching with Lempel-Ziv compressed indexes

LMS Russo, G Navarro, AL Oliveira - … 2007 Santiago, Chile, October 29-31 …, 2007 - Springer
A compressed full-text self-index for a text T is a data structure requiring reduced space and
able of searching for patterns P in T. Furthermore, the structure can reproduce any substring …

Smaller and faster Lempel-Ziv indices

D Arroyuelo, G Navarro - 2007 - repositorio.uchile.cl
Given a text T [1.. u] over an alphabet of size σ= O (polylog (u)) and with k-th order empirical
entropy Hk (T), we propose a new compressed full-text self-index based on the Lempel-Ziv …

Indexed hierarchical approximate string matching

LMS Russo, G Navarro, AL Oliveira - International Symposium on String …, 2008 - Springer
We present a new search procedure for approximate string matching over suffix trees. We
show that hierarchical verification, which is a well-established technique for on-line …