progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement

AE Darling, B Mau, NT Perna - PloS one, 2010 - journals.plos.org
Background Multiple genome alignment remains a challenging problem. Effects of
recombination including rearrangement, segmental duplication, gain, and loss can create a …

Effective sequence similarity detection with strobemers

K Sahlin - Genome research, 2021 - genome.cshlp.org
k-mer-based methods are widely used in bioinformatics for various types of sequence
comparisons. However, a single mutation will mutate k consecutive k-mers and make most k …

Exact string matching algorithms: survey, issues, and future research directions

SI Hakak, A Kamsin, P Shivakumara, GA Gilkar… - IEEE …, 2019 - ieeexplore.ieee.org
String matching has been an extensively studied research domain in the past two decades
due to its various applications in the fields of text, image, signal, and speech processing. As …

ZOOM! Zillions of oligos mapped

H Lin, Z Zhang, MQ Zhang, B Ma, M Li - Bioinformatics, 2008 - academic.oup.com
Motivation: The next generation sequencing technologies are generating billions of short
reads daily. Resequencing and personalized medicine need much faster software to map …

A review on sequence alignment algorithms for short reads based on next-generation sequencing

J Kim, M Ji, G Yi - Ieee Access, 2020 - ieeexplore.ieee.org
With recent advances in next-generation sequencing (NGS) technology, large volumes of
data have been produced in the form of short reads. Sequence assembly involves using …

PerM: efficient map** of short sequencing reads with periodic full sensitive spaced seeds

Y Chen, T Souaiaia, T Chen - Bioinformatics, 2009 - academic.oup.com
Motivation: The explosion of next-generation sequencing data has spawned the design of
new algorithms and software tools to provide efficient map** for different read lengths and …

Incorporating sequence quality data into alignment improves DNA read map**

MC Frith, R Wan, P Horton - Nucleic acids research, 2010 - academic.oup.com
New DNA sequencing technologies have achieved breakthroughs in throughput, at the
expense of higher error rates. The primary way of interpreting biological sequences is via …

Locality-sensitive hashing without false negatives

R Pagh - Proceedings of the twenty-seventh annual ACM-SIAM …, 2016 - SIAM
We consider a new construction of locality-sensitive hash functions for Hamming space that
is covering in the sense that is it guaranteed to produce a collision for every pair of vectors …

A unifying framework for seed sensitivity and its application to subset seeds

G Kucherov, L Noé, M Roytberg - Journal of bioinformatics and …, 2006 - World Scientific
We propose a general approach to compute the seed sensitivity, that can be applied to
different definitions of seeds. It treats separately three components of the seed sensitivity …

[BOOK][B] Sequence comparison: theory and methods

KM Chao, L Zhang - 2008 - books.google.com
Biomolecular sequence comparison is the origin of bioinformatics. Today, powerful
sequence comparison methods, together with comprehensive biological databases, have …