Multiple genome alignment in the telomere-to-telomere assembly era
With the arrival of telomere-to-telomere (T2T) assemblies of the human genome comes the
computational challenge of efficiently and accurately constructing multiple genome …
computational challenge of efficiently and accurately constructing multiple genome …
Opportunities and challenges of data-driven virus discovery
C Lauber, S Seitz - Biomolecules, 2022 - mdpi.com
Virus discovery has been fueled by new technologies ever since the first viruses were
discovered at the end of the 19th century. Starting with mechanical devices that provided …
discovered at the end of the 19th century. Starting with mechanical devices that provided …
Ensembl Genomes 2022: an expanding genome resource for non-vertebrates
AD Yates, J Allen, RM Amode, AG Azov… - Nucleic acids …, 2022 - academic.oup.com
Abstract Ensembl Genomes (https://www. ensemblgenomes. org) provides access to non-
vertebrate genomes and analysis complementing vertebrate resources developed by the …
vertebrate genomes and analysis complementing vertebrate resources developed by the …
Minimizer-space de Bruijn graphs: Whole-genome assembly of long reads in minutes on a personal computer
DNA sequencing data continue to progress toward longer reads with increasingly lower
sequencing error rates. Here, we define an algorithmic approach, mdBG, that makes use of …
sequencing error rates. Here, we define an algorithmic approach, mdBG, that makes use of …
Genomic epidemiology reveals multidrug resistant plasmid spread between Vibrio cholerae lineages in Yemen
F Lassalle, S Al-Shalali, M Al-Hakimi, E Njamkepo… - Nature …, 2023 - nature.com
Since 2016, Yemen has been experiencing the largest cholera outbreak in modern history.
Multidrug resistance (MDR) emerged among Vibrio cholerae isolates from cholera patients …
Multidrug resistance (MDR) emerged among Vibrio cholerae isolates from cholera patients …
Themisto: a scalable colored k-mer index for sensitive pseudoalignment against hundreds of thousands of bacterial genomes
Motivation Huge datasets containing whole-genome sequences of bacterial strains are now
commonplace and represent a rich and important resource for modern genomic …
commonplace and represent a rich and important resource for modern genomic …
Extremely fast construction and querying of compacted and colored de Bruijn graphs with GGCAT
Compacted de Bruijn graphs are one of the most fundamental data structures in
computational genomics. Colored compacted de Bruijn graphs are a variant built on a …
computational genomics. Colored compacted de Bruijn graphs are a variant built on a …
Fulgor: a fast and compact k-mer index for large-scale matching and color queries
The problem of sequence identification or matching—determining the subset of reference
sequences from a given collection that are likely to contain a short, queried nucleotide …
sequences from a given collection that are likely to contain a short, queried nucleotide …
Accurate and fast graph-based pangenome annotation and clustering with ggCaller
Bacterial genomes differ in both gene content and sequence mutations, which underlie
extensive phenotypic diversity, including variation in susceptibility to antimicrobials or …
extensive phenotypic diversity, including variation in susceptibility to antimicrobials or …
Scalable, ultra-fast, and low-memory construction of compacted de Bruijn graphs with Cuttlefish 2
The de Bruijn graph is a key data structure in modern computational genomics, and
construction of its compacted variant resides upstream of many genomic analyses. As the …
construction of its compacted variant resides upstream of many genomic analyses. As the …