Constructing LZ78 tries and position heaps in linear time for large alphabets

Y Nakashima, I Tomohiro, S Inenaga, H Bannai… - Information Processing …, 2015 - Elsevier
We present the first worst-case linear-time algorithm to compute the Lempel–Ziv 78
factorization of a given string over an integer alphabet. Our algorithm is based on nearest …

Position heaps for parameterized strings

T Katsura, Y Otomo, K Narisawa, A Shinohara - arxiv preprint arxiv …, 2017 - arxiv.org
We propose a new indexing structure for parameterized strings, called parameterized
position heap. Parameterized position heap is applicable for parameterized pattern …

Right-to-left online construction of parameterized position heaps

N Fujisato, Y Nakashima, S Inenaga, H Bannai… - arxiv preprint arxiv …, 2018 - arxiv.org
Two strings of equal length are said to parameterized match if there is a bijection that maps
the characters of one string to those of the other string, so that two strings become identical …

[HTML][HTML] On-line construction of position heaps

G Kucherov - Journal of Discrete Algorithms, 2013 - Elsevier
We propose a simple linear-time on-line algorithm for constructing a position heap for a
string (Ehrenfeucht et al., 2011 [8]). Our definition of position heap differs slightly from the …

The parameterized position heap of a trie

N Fujisato, Y Nakashima, S Inenaga, H Bannai… - … on Algorithms and …, 2019 - Springer
Let\varSigma and\varPi be disjoint alphabets of respective size σ and π. Two strings
over\varSigma ∪\varPi of equal length are said to parameterized match (p-match) if there is …

[HTML][HTML] Linear-size suffix tries

M Crochemore, C Epifanio, R Grossi… - Theoretical Computer …, 2016 - Elsevier
Suffix trees are highly regarded data structures for text indexing and string algorithms
[MCreight 76, Weiner 73]. For any given string w of length n=| w|, a suffix tree for w takes O …

Suffix trees, DAWGs and CDAWGs for forward and backward tries

S Inenaga - Latin American Symposium on Theoretical Informatics, 2020 - Springer
The suffix tree, DAWG, and CDAWG are fundamental indexing structures of a string, with a
number of applications in bioinformatics, information retrieval, data mining, etc. An edge …

New algorithms for position heaps

T Gagie, WK Hon, TH Ku - Annual Symposium on Combinatorial Pattern …, 2013 - Springer
We present several results about position heaps, a relatively new alternative to suffix trees
and suffix arrays. First, we show that if we limit the maximum length of patterns to be sought …

Towards a complete perspective on labeled tree indexing: new size bounds, efficient constructions, and beyond

S Inenaga - Journal of Information Processing, 2021 - jstage.jst.go.jp
A labeled tree (or a trie) is a natural generalization of a string which can also be seen as a
compact representation of a set of strings. This paper considers the labeled tree indexing …

String searching

A Ehrenfeucht, RM McConnell - Handbook of Data Structures and …, 2018 - taylorfrancis.com
Searching for occurrences of a substring in a text is a common operation familiar to anyone
who uses a text editor, word processor, or web browser. It is also the case that algorithms for …