Constructing LZ78 tries and position heaps in linear time for large alphabets
We present the first worst-case linear-time algorithm to compute the Lempel–Ziv 78
factorization of a given string over an integer alphabet. Our algorithm is based on nearest …
factorization of a given string over an integer alphabet. Our algorithm is based on nearest …
Position heaps for parameterized strings
T Katsura, Y Otomo, K Narisawa, A Shinohara - arxiv preprint arxiv …, 2017 - arxiv.org
We propose a new indexing structure for parameterized strings, called parameterized
position heap. Parameterized position heap is applicable for parameterized pattern …
position heap. Parameterized position heap is applicable for parameterized pattern …
Right-to-left online construction of parameterized position heaps
Two strings of equal length are said to parameterized match if there is a bijection that maps
the characters of one string to those of the other string, so that two strings become identical …
the characters of one string to those of the other string, so that two strings become identical …
[HTML][HTML] On-line construction of position heaps
G Kucherov - Journal of Discrete Algorithms, 2013 - Elsevier
We propose a simple linear-time on-line algorithm for constructing a position heap for a
string (Ehrenfeucht et al., 2011 [8]). Our definition of position heap differs slightly from the …
string (Ehrenfeucht et al., 2011 [8]). Our definition of position heap differs slightly from the …
The parameterized position heap of a trie
Let\varSigma and\varPi be disjoint alphabets of respective size σ and π. Two strings
over\varSigma ∪\varPi of equal length are said to parameterized match (p-match) if there is …
over\varSigma ∪\varPi of equal length are said to parameterized match (p-match) if there is …
[HTML][HTML] Linear-size suffix tries
Suffix trees are highly regarded data structures for text indexing and string algorithms
[MCreight 76, Weiner 73]. For any given string w of length n=| w|, a suffix tree for w takes O …
[MCreight 76, Weiner 73]. For any given string w of length n=| w|, a suffix tree for w takes O …
Suffix trees, DAWGs and CDAWGs for forward and backward tries
S Inenaga - Latin American Symposium on Theoretical Informatics, 2020 - Springer
The suffix tree, DAWG, and CDAWG are fundamental indexing structures of a string, with a
number of applications in bioinformatics, information retrieval, data mining, etc. An edge …
number of applications in bioinformatics, information retrieval, data mining, etc. An edge …
New algorithms for position heaps
We present several results about position heaps, a relatively new alternative to suffix trees
and suffix arrays. First, we show that if we limit the maximum length of patterns to be sought …
and suffix arrays. First, we show that if we limit the maximum length of patterns to be sought …
Towards a complete perspective on labeled tree indexing: new size bounds, efficient constructions, and beyond
S Inenaga - Journal of Information Processing, 2021 - jstage.jst.go.jp
A labeled tree (or a trie) is a natural generalization of a string which can also be seen as a
compact representation of a set of strings. This paper considers the labeled tree indexing …
compact representation of a set of strings. This paper considers the labeled tree indexing …
String searching
A Ehrenfeucht, RM McConnell - Handbook of Data Structures and …, 2018 - taylorfrancis.com
Searching for occurrences of a substring in a text is a common operation familiar to anyone
who uses a text editor, word processor, or web browser. It is also the case that algorithms for …
who uses a text editor, word processor, or web browser. It is also the case that algorithms for …