Spaces, trees, and colors: The algorithmic landscape of document retrieval on sequences

G Navarro - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
Document retrieval is one of the best-established information retrieval activities since
the'60s, pervading all search engines. Its aim is to obtain, from a collection of text …

Space-efficient preprocessing schemes for range minimum queries on static arrays

J Fischer, V Heun - SIAM Journal on Computing, 2011 - SIAM
Given a static array of n totally ordered objects, the range minimum query problem is to build
a data structure that allows us to answer efficiently subsequent on-line queries of the form …

Fully functional static and dynamic succinct trees

G Navarro, K Sadakane - ACM Transactions on Algorithms (TALG), 2014 - dl.acm.org
We propose new succinct representations of ordinal trees and match various space/time
lower bounds. It is known that any n-node static tree can be represented in 2 n+ o (n) bits so …

Indexing variation graphs

J Sirén - 2017 Proceedings of the ninteenth workshop on …, 2017 - SIAM
Variation graphs, which represent genetic variation within a population, are replacing
sequences as reference genomes. Path indexes are one of the most important tools for …

TADOC: Text analytics directly on compression

F Zhang, J Zhai, X Shen, D Wang, Z Chen, O Mutlu… - The VLDB Journal, 2021 - Springer
This article provides a comprehensive description of text analytics directly on compression
(TADOC), which enables direct document analytics on compressed textual data. The article …

New algorithms on wavelet trees and applications to information retrieval

T Gagie, G Navarro, SJ Puglisi - Theoretical Computer Science, 2012 - Elsevier
Wavelet trees are widely used in the representation of sequences, permutations, text
collections, binary relations, discrete points, and other succinct data structures. We show …

Fully-functional succinct trees

K Sadakane, G Navarro - Proceedings of the twenty-first annual ACM-SIAM …, 2010 - SIAM
We propose new succinct representations of ordinal trees, which have been studied
extensively. It is known that any n-node static tree can be represented in 2 n+ o (n) bits and a …

Space-efficient construction of compressed indexes in deterministic linear time

JI Munro, G Navarro, Y Nekrich - Proceedings of the Twenty-Eighth Annual …, 2017 - SIAM
We show that the compressed suffix array and the compressed suffix tree of a string T can be
built in O (n) deterministic time using O (n log σ) bits of space, where n is the string length …

Faster entropy-bounded compressed suffix trees

J Fischer, V Mäkinen, G Navarro - Theoretical Computer Science, 2009 - Elsevier
Suffix trees are among the most important data structures in stringology, with a number of
applications in flourishing areas like bioinformatics. Their main problem is space usage …

Top-k Document Retrieval in Optimal Time and Linear Space

G Navarro, Y Nekrich - Proceedings of the twenty-third annual ACM-SIAM …, 2012 - SIAM
We describe a data structure that uses O (n)-word space and reports k most relevant
documents that contain a query pattern P in optimal O (| P|+ k) time. Our construction …