progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement
Background Multiple genome alignment remains a challenging problem. Effects of
recombination including rearrangement, segmental duplication, gain, and loss can create a …
recombination including rearrangement, segmental duplication, gain, and loss can create a …
Effective sequence similarity detection with strobemers
K Sahlin - Genome research, 2021 - genome.cshlp.org
k-mer-based methods are widely used in bioinformatics for various types of sequence
comparisons. However, a single mutation will mutate k consecutive k-mers and make most k …
comparisons. However, a single mutation will mutate k consecutive k-mers and make most k …
Exact string matching algorithms: survey, issues, and future research directions
String matching has been an extensively studied research domain in the past two decades
due to its various applications in the fields of text, image, signal, and speech processing. As …
due to its various applications in the fields of text, image, signal, and speech processing. As …
ZOOM! Zillions of oligos mapped
Motivation: The next generation sequencing technologies are generating billions of short
reads daily. Resequencing and personalized medicine need much faster software to map …
reads daily. Resequencing and personalized medicine need much faster software to map …
A review on sequence alignment algorithms for short reads based on next-generation sequencing
With recent advances in next-generation sequencing (NGS) technology, large volumes of
data have been produced in the form of short reads. Sequence assembly involves using …
data have been produced in the form of short reads. Sequence assembly involves using …
PerM: efficient map** of short sequencing reads with periodic full sensitive spaced seeds
Motivation: The explosion of next-generation sequencing data has spawned the design of
new algorithms and software tools to provide efficient map** for different read lengths and …
new algorithms and software tools to provide efficient map** for different read lengths and …
Incorporating sequence quality data into alignment improves DNA read map**
New DNA sequencing technologies have achieved breakthroughs in throughput, at the
expense of higher error rates. The primary way of interpreting biological sequences is via …
expense of higher error rates. The primary way of interpreting biological sequences is via …
Locality-sensitive hashing without false negatives
R Pagh - Proceedings of the twenty-seventh annual ACM-SIAM …, 2016 - SIAM
We consider a new construction of locality-sensitive hash functions for Hamming space that
is covering in the sense that is it guaranteed to produce a collision for every pair of vectors …
is covering in the sense that is it guaranteed to produce a collision for every pair of vectors …
A unifying framework for seed sensitivity and its application to subset seeds
We propose a general approach to compute the seed sensitivity, that can be applied to
different definitions of seeds. It treats separately three components of the seed sensitivity …
different definitions of seeds. It treats separately three components of the seed sensitivity …
[BOOK][B] Sequence comparison: theory and methods
Biomolecular sequence comparison is the origin of bioinformatics. Today, powerful
sequence comparison methods, together with comprehensive biological databases, have …
sequence comparison methods, together with comprehensive biological databases, have …