Alignment-free sequence comparison: benefits, applications, and tools

A Zielezinski, S Vinga, J Almeida, WM Karlowski - Genome biology, 2017 - Springer
Alignment-free sequence analyses have been applied to problems ranging from whole-
genome phylogeny to the classification of protein families, identification of horizontally …

De novo genome assembly and in natura epigenomics reveal salinity‐induced DNA methylation in the mangrove tree Bruguiera gymnorhiza

M Miryeganeh, F Marlétaz, D Gavriouchkina… - New …, 2022 - Wiley Online Library
Mangroves are adapted to harsh environments, such as high ultraviolet (UV) light, low
nutrition, and fluctuating salinity in coastal zones. However, little is known about the …

Fast and accurate phylogeny reconstruction using filtered spaced-word matches

CA Leimeister, S Sohrabi-Jahromi… - …, 2017 - academic.oup.com
Motivation Word-based or 'alignment-free'algorithms are increasingly used for phylogeny
reconstruction and genome comparison, since they are much faster than traditional …

Alignment-free method for DNA sequence clustering using Fuzzy integral similarity

AK Saw, G Raj, M Das, NC Talukdar, BC Tripathy… - Scientific reports, 2019 - nature.com
A larger amount of sequence data in private and public databases produced by next-
generation sequencing put new challenges due to limitation associated with the alignment …

A greedy alignment-free distance estimator for phylogenetic inference

SV Thankachan, SP Chockalingam, Y Liu… - BMC …, 2017 - Springer
Background Alignment-free sequence comparison approaches have been garnering
increasing interest in various data-and compute-intensive applications such as phylogenetic …

rasbhari: Optimizing Spaced Seeds for Database Searching, Read Map** and Alignment-Free Sequence Comparison

L Hahn, CA Leimeister, R Ounit, S Lonardi… - PLoS computational …, 2016 - journals.plos.org
Many algorithms for sequence analysis rely on word matching or word statistics. Often, these
approaches can be improved if binary patterns representing match and don't-care positions …

Prot-SpaM: fast alignment-free phylogeny reconstruction based on whole-proteome sequences

CA Leimeister, J Schellhorn, S Dörrer, M Gerth… - …, 2019 - academic.oup.com
Word-based or 'alignment-free'sequence comparison has become an active research area
in bioinformatics. While previous word-frequency approaches calculated rough measures of …

Longest common factor after one edit operation

A Amir, P Charalampopoulos, CS Iliopoulos… - … Symposium on String …, 2017 - Springer
It is well known that the longest common factor (LCF) of two strings over an integer alphabet
can be computed in time linear in the total length of the two strings. Our aim here is to …

Algorithmic framework for approximate matching under bounded edits with applications to sequence analysis

SV Thankachan, C Aluru, SP Chockalingam… - … Molecular Biology: 22nd …, 2018 - Springer
We present a novel algorithmic framework for solving approximate sequence matching
problems that permit a bounded total number k of mismatches, insertions, and deletions. The …

Linear-Time Algorithm for Long LCF with Mismatches

P Charalampopoulos, M Crochemore… - arxiv preprint arxiv …, 2018 - arxiv.org
In the Longest Common Factor with $ k $ Mismatches (LCF $ _k $) problem, we are given
two strings $ X $ and $ Y $ of total length $ n $, and we are asked to find a pair of maximal …