On indexing and compressing finite automata
An index for a finite automaton is a powerful data structure that supports locating paths
labeled with a query pattern, thus solving pattern matching on the underlying regular …
labeled with a query pattern, thus solving pattern matching on the underlying regular …
Graphs can be succinctly indexed for pattern matching in time
N Cotumaccio - 2022 Data Compression Conference (DCC), 2022 - ieeexplore.ieee.org
For the first time we provide a succinct pattern matching index for arbitrary graphs that can
be built in polynomial time, while improving both space and query time bounds from SODA …
be built in polynomial time, while improving both space and query time bounds from SODA …
[KNJIGA][B] Genome-scale algorithm design: bioinformatics in the era of high-throughput sequencing
Presenting the fundamental algorithms and data structures that power bioinformatics
workflows, this book covers a range of topics from the foundations of sequence analysis …
workflows, this book covers a range of topics from the foundations of sequence analysis …
On the hardness of sequence alignment on De Bruijn graphs
The problem of aligning a sequence to a walk in a labeled graph is of fundamental
importance to Computational Biology. For an arbitrary graph G=(V, E) and a pattern P of …
importance to Computational Biology. For an arbitrary graph G=(V, E) and a pattern P of …
Co-lexicographically ordering automata and regular languages-Part I
The states of a finite-state automaton 𝒩 can be identified with collections of words in the
prefix closure of the regular language accepted by 𝒩. But words can be ordered, and among …
prefix closure of the regular language accepted by 𝒩. But words can be ordered, and among …
Computing matching statistics on Wheeler DFAs
Matching statistics were introduced to solve the approximate string matching problem, which
is a recurrent subroutine in bioinformatics applications. In 2010, Ohlebusch et al.[SPIRE …
is a recurrent subroutine in bioinformatics applications. In 2010, Ohlebusch et al.[SPIRE …
Sorting finite automata via partition refinement
Wheeler nondeterministic finite automata (WNFAs) were introduced as a generalization of
prefix sorting from strings to labeled graphs. WNFAs admit optimal solutions to classic hard …
prefix sorting from strings to labeled graphs. WNFAs admit optimal solutions to classic hard …
On the complexity of recognizing wheeler graphs
In recent years, several compressed indexes based on variants of the Burrows–Wheeler
transform have been introduced. Some of these are used to index structures far more …
transform have been introduced. Some of these are used to index structures far more …
Parameterized algorithms for string matching to DAGs: Funnels and beyond
M Caceres - arxiv preprint arxiv:2212.07870, 2022 - arxiv.org
The problem of String Matching to Labeled Graphs (SMLG) asks to find all the paths in a
labeled graph $ G=(V, E) $ whose spellings match that of an input string $ S\in\Sigma^ m …
labeled graph $ G=(V, E) $ whose spellings match that of an input string $ S\in\Sigma^ m …
[HTML][HTML] A new class of string transformations for compressed text indexing
Introduced about thirty years ago in the field of data compression, the Burrows-Wheeler
Transform (BWT) is a string transformation that, besides being a booster of the performance …
Transform (BWT) is a string transformation that, besides being a booster of the performance …