[PDF][PDF] Computational pan-genomics: status, promises and challenges
Briefings in bioinformatics, 2018 - academic.oup.com
Many disciplines, from human genetics and oncology to plant breeding, microbiology and
virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes …
virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes …
A benchmark study of k-mer counting methods for high-throughput sequencing
SC Manekar, SR Sathe - GigaScience, 2018 - academic.oup.com
The rapid development of high-throughput sequencing technologies means that hundreds of
gigabytes of sequencing data can be produced in a single study. Many bioinformatics tools …
gigabytes of sequencing data can be produced in a single study. Many bioinformatics tools …
ABySS 2.0: resource-efficient assembly of large genomes using a Bloom filter
The assembly of DNA sequences de novo is fundamental to genomics research. It is the first
of many steps toward elucidating and characterizing whole genomes. Downstream …
of many steps toward elucidating and characterizing whole genomes. Downstream …
KMC 3: counting and manipulating k-mer statistics
Counting all k-mers in a given dataset is a standard procedure in many bioinformatics
applications. We introduce KMC3, a significant improvement of the former KMC2 algorithm …
applications. We introduce KMC3, a significant improvement of the former KMC2 algorithm …
Informed and automated k-mer size selection for genome assembly
Motivation: Genome assembly tools based on the de Bruijn graph framework rely on a
parameter k, which represents a trade-off between several competing effects that are difficult …
parameter k, which represents a trade-off between several competing effects that are difficult …
Assembly of long error-prone reads using de Bruijn graphs
The recent breakthroughs in assembling long error-prone reads were based on the overlap-
layout-consensus (OLC) approach and did not utilize the strengths of the alternative de …
layout-consensus (OLC) approach and did not utilize the strengths of the alternative de …
Genome-wide association studies of global Mycobacterium tuberculosis resistance to 13 antimicrobials in 10,228 genomes identify new resistance mechanisms
CRyPTIC Consortium - PLoS biology, 2022 - journals.plos.org
The emergence of drug-resistant tuberculosis is a major global public health concern that
threatens the ability to control the disease. Whole-genome sequencing as a tool to rapidly …
threatens the ability to control the disease. Whole-genome sequencing as a tool to rapidly …
Identifying lineage effects when controlling for population structure improves power in bacterial association studies
Bacteria pose unique challenges for genome-wide association studies because of strong
structuring into distinct strains and substantial linkage disequilibrium across the genome 1 …
structuring into distinct strains and substantial linkage disequilibrium across the genome 1 …
Space-efficient and exact de Bruijn graph representation based on a Bloom filter
Abstract Background The de Bruijn graph data structure is widely used in next-generation
sequencing (NGS). Many programs, eg de novo assemblers, rely on in-memory …
sequencing (NGS). Many programs, eg de novo assemblers, rely on in-memory …
KMC 2: fast and resource-frugal k-mer counting
Motivation: Building the histogram of occurrences of every k-symbol long substring of
nucleotide data is a standard step in many bioinformatics applications, known under the …
nucleotide data is a standard step in many bioinformatics applications, known under the …