When less is more: sketching with minimizers in genomics
The exponential increase in sequencing data calls for conceptual and computational
advances to extract useful biological insights. One such advance, minimizers, allows for …
advances to extract useful biological insights. One such advance, minimizers, allows for …
Unveiling microbial diversity: harnessing long-read sequencing technology
Long-read sequencing has recently transformed metagenomics, enhancing strain-level
pathogen characterization, enabling accurate and complete metagenome-assembled …
pathogen characterization, enabling accurate and complete metagenome-assembled …
Small polymorphisms are a source of ancestral bias in structural variant breakpoint placement
High-quality genome assemblies and sophisticated algorithms have increased sensitivity for
a wide range of variant types, and breakpoint accuracy for structural variants (SVs,≥ 50 bp) …
a wide range of variant types, and breakpoint accuracy for structural variants (SVs,≥ 50 bp) …
Proving sequence aligners can guarantee accuracy in almost O (m log n) time through an average-case analysis of the seed-chain-extend heuristic
Seed-chain-extend with k-mer seeds is a powerful heuristic technique for sequence
alignment used by modern sequence aligners. Although effective in practice for both runtime …
alignment used by modern sequence aligners. Although effective in practice for both runtime …
Block Aligner: an adaptive SIMD-accelerated aligner for sequences and position-specific scoring matrices
Motivation Efficiently aligning sequences is a fundamental problem in bioinformatics. Many
recent algorithms for computing alignments through Smith–Waterman–Gotoh dynamic …
recent algorithms for computing alignments through Smith–Waterman–Gotoh dynamic …
Sequence to graph alignment using gap-sensitive co-linear chaining
Co-linear chaining is a widely used technique in sequence alignment tools that follow seed-
filter-extend methodology. It is a mathematically rigorous approach to combine short exact …
filter-extend methodology. It is a mathematically rigorous approach to combine short exact …
Complete mitochondrial genome of Agropyron cristatum reveals gene transfer and RNA editing events
T Ou, Z Wu, C Tian, Y Yang, Z Li - BMC Plant Biology, 2024 - Springer
Background As an important forage in arid and semi-arid regions, Agropyron cristatum
provides livestock with exceptionally high nutritional value. Additionally, A. cristatum exhibits …
provides livestock with exceptionally high nutritional value. Additionally, A. cristatum exhibits …
ModDotPlot—rapid and interactive visualization of tandem repeats
Motivation A common method for analyzing genomic repeats is to produce a sequence
similarity matrix visualized via a dot plot. Innovative approaches such as StainedGlass have …
similarity matrix visualized via a dot plot. Innovative approaches such as StainedGlass have …
Entropy predicts sensitivity of pseudorandom seeds
Seed design is important for sequence similarity search applications such as read map**
and average nucleotide identity (ANI) estimation. Although k-mers and spaced k-mers are …
and average nucleotide identity (ANI) estimation. Although k-mers and spaced k-mers are …
An FPGA-based hardware accelerator supporting sensitive sequence homology filtering with profile hidden Markov models
Background Sequence alignment lies at the heart of genome sequence annotation. While
the BLAST suite of alignment tools has long held an important role in alignment-based …
the BLAST suite of alignment tools has long held an important role in alignment-based …