Spaces, trees, and colors: The algorithmic landscape of document retrieval on sequences
G Navarro - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
Document retrieval is one of the best-established information retrieval activities since
the'60s, pervading all search engines. Its aim is to obtain, from a collection of text …
the'60s, pervading all search engines. Its aim is to obtain, from a collection of text …
Higher lower bounds from the 3SUM conjecture
The 3SUM conjecture has proven to be a valuable tool for proving conditional lower bounds
on dynamic data structures and graph problems. This line of work was initiated by Pâtraşcu …
on dynamic data structures and graph problems. This line of work was initiated by Pâtraşcu …
New algorithms on wavelet trees and applications to information retrieval
Wavelet trees are widely used in the representation of sequences, permutations, text
collections, binary relations, discrete points, and other succinct data structures. We show …
collections, binary relations, discrete points, and other succinct data structures. We show …
Top-k Document Retrieval in Optimal Time and Linear Space
We describe a data structure that uses O (n)-word space and reports k most relevant
documents that contain a query pattern P in optimal O (| P|+ k) time. Our construction …
documents that contain a query pattern P in optimal O (| P|+ k) time. Our construction …
Conditional lower bounds for space/time tradeoffs
In recent years much effort has been concentrated towards achieving polynomial time lower
bounds on algorithms for solving various well-known problems. A useful technique for …
bounds on algorithms for solving various well-known problems. A useful technique for …
Inverted indexes for phrases and strings
Inverted indexes are the most fundamental and widely used data structures in information
retrieval. For each unique word occurring in a document collection, the inverted index stores …
retrieval. For each unique word occurring in a document collection, the inverted index stores …
Wavelet trees: A survey
C Makris - Computer Science and Information Systems, 2012 - doiserbia.nb.rs
The topic of this paper is the exploration of the various characteristics of the wavelet tree
data structure, a data structure that was initially proposed for text compression applications …
data structure, a data structure that was initially proposed for text compression applications …
Space-Efficient Frameworks for Top-k String Retrieval
The inverted index is the backbone of modern web search engines. For each word in a
collection of web documents, the index records the list of documents where this word occurs …
collection of web documents, the index records the list of documents where this word occurs …
[HTML][HTML] On non-progressive spread of influence through social networks
The spread of influence in social networks is studied in two main categories: progressive
models and non-progressive models (see, eg, the seminal work of Kempe et al.[8]). While …
models and non-progressive models (see, eg, the seminal work of Kempe et al.[8]). While …
[HTML][HTML] On hardness of several string indexing problems
Abstract Let D={d 1, d 2,…, d D} be a collection of D string documents of n characters in total.
The two-pattern matching problems ask to index D for answering the following queries …
The two-pattern matching problems ask to index D for answering the following queries …