Spaces, trees, and colors: The algorithmic landscape of document retrieval on sequences

G Navarro - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
Document retrieval is one of the best-established information retrieval activities since
the'60s, pervading all search engines. Its aim is to obtain, from a collection of text …

[CARTE][B] Compact data structures: A practical approach

G Navarro - 2016 - books.google.com
Compact data structures help represent data in reduced space while allowing it to be
queried, navigated, and operated in compressed form. They are essential tools for efficiently …

[HTML][HTML] Wavelet trees for all

G Navarro - Journal of Discrete Algorithms, 2014 - Elsevier
The wavelet tree is a versatile data structure that serves a number of purposes, from string
processing to computational geometry. It can be regarded as a device that represents a …

The wavelet matrix: An efficient wavelet tree for large alphabets

F Claude, G Navarro, A Ordónez - Information Systems, 2015 - Elsevier
The wavelet tree is a flexible data structure that permits representing sequences S [1, n] of
symbols over an alphabet of size σ, within compressed space and supporting a wide range …

Top-k Document Retrieval in Optimal Time and Linear Space

G Navarro, Y Nekrich - Proceedings of the twenty-third annual ACM-SIAM …, 2012 - SIAM
We describe a data structure that uses O (n)-word space and reports k most relevant
documents that contain a query pattern P in optimal O (| P|+ k) time. Our construction …

Columnar storage and list-based processing for graph database management systems

P Gupta, A Mhedhbi, S Salihoglu - arxiv preprint arxiv:2103.02284, 2021 - arxiv.org
We revisit column-oriented storage and query processing techniques in the context of
contemporary graph database management systems (GDBMSs). Similar to column-oriented …

[HTML][HTML] Fast construction of wavelet trees

JI Munro, Y Nekrich, JS Vitter - Theoretical Computer Science, 2016 - Elsevier
Fast construction of wavelet trees - ScienceDirect Skip to main contentSkip to article
Elsevier logo Journals & Books Search RegisterSign in View PDF Download full issue …

[HTML][HTML] Improved range minimum queries

H Ferrada, G Navarro - Journal of Discrete Algorithms, 2017 - Elsevier
Fischer and Heun [SICOMP 2011] proposed the first Range Minimum Query (RMQ) data
structure on an array A [1, n] that uses 2 n+ o (n) bits and answers queries in O (1) time …

Time-Optimal Top- Document Retrieval

G Navarro, Y Nekrich - SIAM Journal on Computing, 2017 - SIAM
Let \mathcalD be a collection of D documents, which are strings over an alphabet of size σ,
of total length n. We describe a data structure that uses linear space and reports k most …

[HTML][HTML] Compact binary relation representations with rich functionality

J Barbay, F Claude, G Navarro - Information and Computation, 2013 - Elsevier
Binary relations are an important abstraction arising in many data representation problems.
The data structures proposed so far to represent them support just a few basic operations …