[HTML][HTML] Refining the r-index

H Bannai, T Gagie, I Tomohiro - Theoretical Computer Science, 2020 - Elsevier
Abstract Gagie, Navarro and Prezza's r-index (SODA, 2018) promises to speed up DNA
alignment and variation calling by allowing us to index entire genomic databases, provided …

[HTML][HTML] Graphs cannot be indexed in polynomial time for sub-quadratic time string matching, unless SETH fails

M Equi, V Mäkinen, AI Tomescu - Theoretical Computer Science, 2023 - Elsevier
The string matching problem on a node-labeled graph G=(V, E) asks whether a given
pattern string P equals the concatenation of node labels of some path in G. This is a basic …

Faster algorithms for longest common substring

P Charalampopoulos, T Kociumaka, SP Pissis… - arxiv preprint arxiv …, 2021 - arxiv.org
In the classic longest common substring (LCS) problem, we are given two strings $ S $ and
$ T $, each of length at most $ n $, over an alphabet of size $\sigma $, and we are asked to …

A compact index for cartesian tree matching

SH Kim, HG Cho - … on Combinatorial Pattern Matching (CPM 2021 …, 2021 - drops.dagstuhl.de
Cartesian tree matching is a recently introduced string matching problem in which two
strings match if their corresponding Cartesian trees are the same. It is considered …

Approximate similarity search under edit distance using locality-sensitive hashing

S McCauley - arxiv preprint arxiv:1907.01600, 2019 - arxiv.org
Edit distance similarity search, also called approximate pattern matching, is a fundamental
problem with widespread database applications. The goal of the problem is to preprocess …

Correcting 4sU induced quantification bias in nucleotide conversion RNA-seq data

K Berg, M Lodha, I Delazer, K Bartosik… - Nucleic Acids …, 2024 - academic.oup.com
Abstract Nucleoside analogues like 4-thiouridine (4sU) are used to metabolically label
newly synthesized RNA. Chemical conversion of 4sU before sequencing induces T-to-C …

Pattern masking for dictionary matching: theory and practice

P Charalampopoulos, H Chen, P Christen, G Loukides… - Algorithmica, 2024 - Springer
Data masking is a common technique for sanitizing sensitive data maintained in database
systems which is becoming increasingly important in various application areas, such as in …

Pattern Masking for Dictionary Matching

P Charalampopoulos, H Chen, P Christen… - arxiv preprint arxiv …, 2020 - arxiv.org
In the Pattern Masking for Dictionary Matching (PMDM) problem, we are given a dictionary
$\mathcal {D} $ of $ d $ strings, each of length $\ell $, a query string $ q $ of length $\ell …

A lower bound for dynamic fractional cascading

P Afshani - Proceedings of the 2021 ACM-SIAM Symposium on …, 2021 - SIAM
We investigate the limits of one of the fundamental ideas in data structures: fractional
cascading. This is an important data structure technique to speed up repeated searches for …