Overlap graphs and de Bruijn graphs: data structures for de novo genome assembly in the big data era
Background De novo genome assembly relies on two kinds of graphs: de Bruijn graphs and
overlap graphs. Overlap graphs are the basis for the Celera assembler, while de Bruijn …
overlap graphs. Overlap graphs are the basis for the Celera assembler, while de Bruijn …
Parallel string graph construction and transitive reduction for de novo genome assembly
One of the most computationally intensive tasks in computational biology is de novo genome
assembly, the decoding of the sequence of an unknown genome from redundant and …
assembly, the decoding of the sequence of an unknown genome from redundant and …
Multithread multistring Burrows–Wheeler transform and longest common prefix array
Indexing huge collections of strings, such as those produced by the widespread sequencing
technologies, heavily relies on multistring generalizations of the Burrows–Wheeler transform …
technologies, heavily relies on multistring generalizations of the Burrows–Wheeler transform …
Gpu-accelerated large-scale genome assembly
Spurred by a widening gap between hardware accelerators and traditional processors,
numerous bioinformatics applications have harnessed the computing power of GPUs and …
numerous bioinformatics applications have harnessed the computing power of GPUs and …
Approximate suffix-prefix dictionary queries
W Zuba, G Loukides, SP Pissis… - … of Computer Science …, 2024 - drops.dagstuhl.de
In the all-pairs suffix-prefix (APSP) problem [Gusfield et al., Inf. Process. Lett. 1992], we are
given a dictionary R of r strings, S₁,…, S_r, of total length n, and we are asked to find the …
given a dictionary R of r strings, S₁,…, S_r, of total length n, and we are asked to find the …
Can Formal Languages help Pangenomics to represent and analyze multiple genomes?
Graph pangenomics is a new emerging field in computational biology that is changing the
traditional view of a reference genome from a linear sequence to a new paradigm: a …
traditional view of a reference genome from a linear sequence to a new paradigm: a …
[PDF][PDF] Suffix-prefix queries on a dictionary
In the all-pairs suffix-prefix (APSP) problem, we are given a dictionary R of k strings, S1,...,
Sk, of total length n, and we are asked to find the length SPLi, j of the longest string that is …
Sk, of total length n, and we are asked to find the length SPLi, j of the longest string that is …
On the longest common prefix of suffixes in an inverse Lyndon factorization and other properties
The Lyndon factorization of a word has been largely studied and recently variants of it have
been introduced and investigated with different motivations. In particular, the canonical …
been introduced and investigated with different motivations. In particular, the canonical …
An analysis of automated software diversity using unstructured text analytics
AS Gearhart, PA Hamilton… - 2018 48th Annual IEEE …, 2018 - ieeexplore.ieee.org
Automated software diversity promises to reduce an attacker's ability to reuse exploits across
application instances. However, many questions remain regarding the efficacy of and …
application instances. However, many questions remain regarding the efficacy of and …
Lyndon words versus inverse Lyndon words: Queries on suffixes and bordered words
The Lyndon factorization of a word has been extensively studied in different contexts and
several variants of it have been proposed. In particular, the canonical inverse Lyndon …
several variants of it have been proposed. In particular, the canonical inverse Lyndon …