On indexing and compressing finite automata

N Cotumaccio, N Prezza - Proceedings of the 2021 ACM-SIAM Symposium on …, 2021 - SIAM
An index for a finite automaton is a powerful data structure that supports locating paths
labeled with a query pattern, thus solving pattern matching on the underlying regular …

Graphs can be succinctly indexed for pattern matching in time

N Cotumaccio - 2022 Data Compression Conference (DCC), 2022 - ieeexplore.ieee.org
For the first time we provide a succinct pattern matching index for arbitrary graphs that can
be built in polynomial time, while improving both space and query time bounds from SODA …

[KNJIGA][B] Genome-scale algorithm design: bioinformatics in the era of high-throughput sequencing

V Mäkinen, D Belazzougui, F Cunial, AI Tomescu - 2023 - books.google.com
Presenting the fundamental algorithms and data structures that power bioinformatics
workflows, this book covers a range of topics from the foundations of sequence analysis …

On the hardness of sequence alignment on De Bruijn graphs

D Gibney, SV Thankachan, S Aluru - Journal of Computational …, 2022 - liebertpub.com
The problem of aligning a sequence to a walk in a labeled graph is of fundamental
importance to Computational Biology. For an arbitrary graph G=(V, E) and a pattern P of …

Co-lexicographically ordering automata and regular languages-Part I

N Cotumaccio, G D'Agostino, A Policriti, N Prezza - Journal of the ACM, 2023 - dl.acm.org
The states of a finite-state automaton 𝒩 can be identified with collections of words in the
prefix closure of the regular language accepted by 𝒩. But words can be ordered, and among …

Computing matching statistics on Wheeler DFAs

A Conte, N Cotumaccio, T Gagie… - 2023 Data …, 2023 - ieeexplore.ieee.org
Matching statistics were introduced to solve the approximate string matching problem, which
is a recurrent subroutine in bioinformatics applications. In 2010, Ohlebusch et al.[SPIRE …

Sorting finite automata via partition refinement

R Becker, M Cáceres, D Cenzato, SH Kim… - arxiv preprint arxiv …, 2023 - arxiv.org
Wheeler nondeterministic finite automata (WNFAs) were introduced as a generalization of
prefix sorting from strings to labeled graphs. WNFAs admit optimal solutions to classic hard …

On the complexity of recognizing wheeler graphs

D Gibney, SV Thankachan - Algorithmica, 2022 - Springer
In recent years, several compressed indexes based on variants of the Burrows–Wheeler
transform have been introduced. Some of these are used to index structures far more …

Parameterized algorithms for string matching to DAGs: Funnels and beyond

M Caceres - arxiv preprint arxiv:2212.07870, 2022 - arxiv.org
The problem of String Matching to Labeled Graphs (SMLG) asks to find all the paths in a
labeled graph $ G=(V, E) $ whose spellings match that of an input string $ S\in\Sigma^ m …

[HTML][HTML] A new class of string transformations for compressed text indexing

R Giancarlo, G Manzini, A Restivo, G Rosone… - Information and …, 2023 - Elsevier
Introduced about thirty years ago in the field of data compression, the Burrows-Wheeler
Transform (BWT) is a string transformation that, besides being a booster of the performance …