Spaces, trees, and colors: The algorithmic landscape of document retrieval on sequences

G Navarro - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
Document retrieval is one of the best-established information retrieval activities since
the'60s, pervading all search engines. Its aim is to obtain, from a collection of text …

Higher lower bounds from the 3SUM conjecture

T Kopelowitz, S Pettie, E Porat - Proceedings of the twenty-seventh annual …, 2016 - SIAM
The 3SUM conjecture has proven to be a valuable tool for proving conditional lower bounds
on dynamic data structures and graph problems. This line of work was initiated by Pâtraşcu …

New algorithms on wavelet trees and applications to information retrieval

T Gagie, G Navarro, SJ Puglisi - Theoretical Computer Science, 2012 - Elsevier
Wavelet trees are widely used in the representation of sequences, permutations, text
collections, binary relations, discrete points, and other succinct data structures. We show …

Top-k Document Retrieval in Optimal Time and Linear Space

G Navarro, Y Nekrich - Proceedings of the twenty-third annual ACM-SIAM …, 2012 - SIAM
We describe a data structure that uses O (n)-word space and reports k most relevant
documents that contain a query pattern P in optimal O (| P|+ k) time. Our construction …

Conditional lower bounds for space/time tradeoffs

I Goldstein, T Kopelowitz, M Lewenstein… - Algorithms and Data …, 2017 - Springer
In recent years much effort has been concentrated towards achieving polynomial time lower
bounds on algorithms for solving various well-known problems. A useful technique for …

Inverted indexes for phrases and strings

M Patil, SV Thankachan, R Shah, WK Hon… - Proceedings of the 34th …, 2011 - dl.acm.org
Inverted indexes are the most fundamental and widely used data structures in information
retrieval. For each unique word occurring in a document collection, the inverted index stores …

Wavelet trees: A survey

C Makris - Computer Science and Information Systems, 2012 - doiserbia.nb.rs
The topic of this paper is the exploration of the various characteristics of the wavelet tree
data structure, a data structure that was initially proposed for text compression applications …

Space-Efficient Frameworks for Top-k String Retrieval

WK Hon, R Shah, SV Thankachan… - Journal of the ACM (JACM), 2014 - dl.acm.org
The inverted index is the backbone of modern web search engines. For each word in a
collection of web documents, the index records the list of documents where this word occurs …

[HTML][HTML] On non-progressive spread of influence through social networks

MA Fazli, M Ghodsi, J Habibi, P Jalaly… - Theoretical Computer …, 2014 - Elsevier
The spread of influence in social networks is studied in two main categories: progressive
models and non-progressive models (see, eg, the seminal work of Kempe et al.[8]). While …

[HTML][HTML] On hardness of several string indexing problems

KG Larsen, JI Munro, JS Nielsen… - Theoretical Computer …, 2015 - Elsevier
Abstract Let D={d 1, d 2,…, d D} be a collection of D string documents of n characters in total.
The two-pattern matching problems ask to index D for answering the following queries …