Eleven grand challenges in single-cell data science

D Lähnemann, J Köster, E Szczurek, DJ McCarthy… - Genome biology, 2020 - Springer
The recent boom in microfluidics and combinatorial indexing strategies, combined with low
sequencing costs, has empowered single-cell sequencing technology. Thousands—or even …

Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers

L Wratten, A Wilm, J Göke - Nature methods, 2021 - nature.com
The rapid growth of high-throughput technologies has transformed biomedical research.
With the increasing amount and complexity of data, scalability and reproducibility have …

Benchmarking graph neural networks

VP Dwivedi, CK Joshi, AT Luu, T Laurent… - Journal of Machine …, 2023 - jmlr.org
In the last few years, graph neural networks (GNNs) have become the standard toolkit for
analyzing and learning from data on graphs. This emerging field has witnessed an extensive …

State of the field in multi-omics research: from computational needs to data mining and sharing

M Krassowski, V Das, SK Sahu, BB Misra - Frontiers in Genetics, 2020 - frontiersin.org
Multi-omics, variously called integrated omics, pan-omics, and trans-omics, aims to combine
two or more omics data sets to aid in data analysis, visualization and interpretation to …

DNA methylation-based predictors of health: applications and statistical considerations

PD Yousefi, M Suderman, R Langdon… - Nature Reviews …, 2022 - nature.com
DNA methylation data have become a valuable source of information for biomarker
development, because, unlike static genetic risk estimates, DNA methylation varies …

AltWOA: Altruistic Whale Optimization Algorithm for feature selection on microarray datasets

R Kundu, S Chattopadhyay, E Cuevas… - Computers in biology and …, 2022 - Elsevier
The data-driven modern era has enabled the collection of large amounts of biomedical and
clinical data. DNA microarray gene expression datasets have mainly gained significant …

A benchmark for RNA-seq deconvolution analysis under dynamic testing environments

H **, Z Liu - Genome biology, 2021 - Springer
Background Deconvolution analyses have been widely used to track compositional
alterations of cell types in gene expression data. Although a large number of novel methods …

Benchmarking computational doublet-detection methods for single-cell RNA sequencing data

NM **, JJ Li - Cell systems, 2021 - cell.com
In single-cell RNA sequencing (scRNA-seq), doublets form when two cells are encapsulated
into one reaction volume. The existence of doublets, which appear to be—but are not—real …

A compact vocabulary of paratope-epitope interactions enables predictability of antibody-antigen binding

R Akbar, PA Robert, M Pavlović, JR Jeliazkov… - Cell Reports, 2021 - cell.com
Antibody-antigen binding relies on the specific interaction of amino acids at the paratope-
epitope interface. The predictability of antibody-antigen binding is a prerequisite for de novo …

Spearheading future omics analyses using dyngen, a multi-modal simulator of single cells

R Cannoodt, W Saelens, L Deconinck… - Nature …, 2021 - nature.com
We present dyngen, a multi-modal simulation engine for studying dynamic cellular
processes at single-cell resolution. dyngen is more flexible than current single-cell …