Principal component analysis: A natural approach to data exploration

FL Gewers, GR Ferreira, HFD Arruda, FN Silva… - ACM Computing …, 2021 - dl.acm.org
Principal component analysis (PCA) is often applied for analyzing data in the most diverse
areas. This work reports, in an accessible and integrated manner, several theoretical and …

Integrative functional genomic analysis of human brain development and neuropsychiatric risks

M Li, G Santpere, Y Imamura Kawasawa, OV Evgrafov… - Science, 2018 - science.org
INTRODUCTION The brain is responsible for cognition, behavior, and much of what makes
us uniquely human. The development of the brain is a highly complex process, and this …

A general and flexible method for signal extraction from single-cell RNA-seq data

D Risso, F Perraudeau, S Gribkova, S Dudoit… - Nature …, 2018 - nature.com
Single-cell RNA-sequencing (scRNA-seq) is a powerful high-throughput technique that
enables researchers to measure genome-wide transcription levels at the resolution of single …

Applying artificial intelligence in the microbiome for gastrointestinal diseases: a review

T Zeng, X Yu, Z Chen - Journal of Gastroenterology and …, 2021 - Wiley Online Library
For a long time, gut bacteria have been recognized for their important roles in the
occurrence and progression of gastrointestinal diseases like colorectal cancer, and the ever …

Batch effect detection and correction in RNA-seq data using machine-learning-based automated assessment of quality

M Sprang, MA Andrade-Navarro, JF Fontaine - BMC bioinformatics, 2022 - Springer
Background The constant evolving and development of next-generation sequencing
techniques lead to high throughput data composed of datasets that include a large number …

Benchmarking principal component analysis for large-scale single-cell RNA-sequencing

K Tsuyuzaki, H Sato, K Sato, I Nikaido - Genome biology, 2020 - Springer
Background Principal component analysis (PCA) is an essential method for analyzing single-
cell RNA-seq (scRNA-seq) datasets, but for large-scale scRNA-seq datasets, computation …

RA3 is a reference-guided approach for epigenetic characterization of single cells

S Chen, G Yan, W Zhang, J Li, R Jiang, Z Lin - Nature Communications, 2021 - nature.com
The recent advancements in single-cell technologies, including single-cell chromatin
accessibility sequencing (scCAS), have enabled profiling the epigenetic landscapes for …

AC-PCoA: Adjustment for confounding factors using principal coordinate analysis

Y Wang, F Sun, W Lin, S Zhang - PLoS computational biology, 2022 - journals.plos.org
Confounding factors exist widely in various biological data owing to technical variations,
population structures and experimental conditions. Such factors may mask the true signals …

aPCoA: covariate adjusted principal coordinates analysis

Y Shi, L Zhang, KA Do, CB Peterson, RR Jenq - Bioinformatics, 2020 - academic.oup.com
In fields, such as ecology, microbiology and genomics, non-Euclidean distances are widely
applied to describe pairwise dissimilarity between samples. Given these pairwise distances …

scMC learns biological variation through the alignment of multiple single-cell genomics datasets

L Zhang, Q Nie - Genome biology, 2021 - Springer
Distinguishing biological from technical variation is crucial when integrating and comparing
single-cell genomics datasets across different experiments. Existing methods lack the …