[HTML][HTML] Refining the r-index
Abstract Gagie, Navarro and Prezza's r-index (SODA, 2018) promises to speed up DNA
alignment and variation calling by allowing us to index entire genomic databases, provided …
alignment and variation calling by allowing us to index entire genomic databases, provided …
[HTML][HTML] Graphs cannot be indexed in polynomial time for sub-quadratic time string matching, unless SETH fails
The string matching problem on a node-labeled graph G=(V, E) asks whether a given
pattern string P equals the concatenation of node labels of some path in G. This is a basic …
pattern string P equals the concatenation of node labels of some path in G. This is a basic …
Faster algorithms for longest common substring
P Charalampopoulos, T Kociumaka, SP Pissis… - arxiv preprint arxiv …, 2021 - arxiv.org
In the classic longest common substring (LCS) problem, we are given two strings $ S $ and
$ T $, each of length at most $ n $, over an alphabet of size $\sigma $, and we are asked to …
$ T $, each of length at most $ n $, over an alphabet of size $\sigma $, and we are asked to …
A compact index for cartesian tree matching
Cartesian tree matching is a recently introduced string matching problem in which two
strings match if their corresponding Cartesian trees are the same. It is considered …
strings match if their corresponding Cartesian trees are the same. It is considered …
Approximate similarity search under edit distance using locality-sensitive hashing
S McCauley - arxiv preprint arxiv:1907.01600, 2019 - arxiv.org
Edit distance similarity search, also called approximate pattern matching, is a fundamental
problem with widespread database applications. The goal of the problem is to preprocess …
problem with widespread database applications. The goal of the problem is to preprocess …
Correcting 4sU induced quantification bias in nucleotide conversion RNA-seq data
K Berg, M Lodha, I Delazer, K Bartosik… - Nucleic Acids …, 2024 - academic.oup.com
Abstract Nucleoside analogues like 4-thiouridine (4sU) are used to metabolically label
newly synthesized RNA. Chemical conversion of 4sU before sequencing induces T-to-C …
newly synthesized RNA. Chemical conversion of 4sU before sequencing induces T-to-C …
Pattern masking for dictionary matching: theory and practice
Data masking is a common technique for sanitizing sensitive data maintained in database
systems which is becoming increasingly important in various application areas, such as in …
systems which is becoming increasingly important in various application areas, such as in …
Pattern Masking for Dictionary Matching
In the Pattern Masking for Dictionary Matching (PMDM) problem, we are given a dictionary
$\mathcal {D} $ of $ d $ strings, each of length $\ell $, a query string $ q $ of length $\ell …
$\mathcal {D} $ of $ d $ strings, each of length $\ell $, a query string $ q $ of length $\ell …
A lower bound for dynamic fractional cascading
P Afshani - Proceedings of the 2021 ACM-SIAM Symposium on …, 2021 - SIAM
We investigate the limits of one of the fundamental ideas in data structures: fractional
cascading. This is an important data structure technique to speed up repeated searches for …
cascading. This is an important data structure technique to speed up repeated searches for …