Dealing with missing values in large-scale studies: microarray data imputation and beyond
T Aittokallio - Briefings in bioinformatics, 2010 - academic.oup.com
High-throughput biotechnologies, such as gene expression microarrays or mass-
spectrometry-based proteomic assays, suffer from frequent missing values due to various …
spectrometry-based proteomic assays, suffer from frequent missing values due to various …
Robust methods for multivariate data analysis
SF Møller, J von Frese, R Bro - Journal of Chemometrics: A …, 2005 - Wiley Online Library
Outliers may hamper proper classical multivariate analysis, and lead to incorrect
conclusions. To remedy the problem of outliers, robust methods are developed in statistics …
conclusions. To remedy the problem of outliers, robust methods are developed in statistics …
The history of the cluster heat map
The cluster heat map is an ingenious display that simultaneously reveals row and column
hierarchical cluster structure in a data matrix. It consists of a rectangular tiling, with each tile …
hierarchical cluster structure in a data matrix. It consists of a rectangular tiling, with each tile …
Matrix reordering methods for table and network visualization
This survey provides a description of algorithms to reorder visual matrices of tabular data
and adjacency matrix of Networks. The goal of this survey is to provide a comprehensive list …
and adjacency matrix of Networks. The goal of this survey is to provide a comprehensive list …
Clustering cancer gene expression data: a comparative study
Background The use of clustering methods for the discovery of cancer subtypes has drawn a
great deal of attention in the scientific community. While bioinformaticians have proposed …
great deal of attention in the scientific community. While bioinformaticians have proposed …
Comparison of Affymetrix GeneChip expression measures
Abstract Motivation: In the Affymetrix GeneChip system, preprocessing occurs before one
obtains expression level measurements. Because the number of competing preprocessing …
obtains expression level measurements. Because the number of competing preprocessing …
A data-driven koopman approach for power system nonlinear dynamic observability analysis
A prerequisite to dynamic state estimation of a stochastic nonlinear dynamic model of a
power system is its observability analysis. However, due to the model nonlinearity, the …
power system is its observability analysis. However, due to the model nonlinearity, the …
Propagation of outliers in multivariate data
We investigate the performance of robust estimates of multivariate location under
nonstandard data contamination models such as componentwise outliers (ie, contamination …
nonstandard data contamination models such as componentwise outliers (ie, contamination …
[HTML][HTML] Large covariance estimation through elliptical factor models
We propose a general Principal Orthogonal complEment Thresholding (POET) framework
for large-scale covariance matrix estimation based on the approximate factor model. A set of …
for large-scale covariance matrix estimation based on the approximate factor model. A set of …
[書籍][B] Analysis of microarray gene expression data
MLT Lee - 2007 - books.google.com
After genomic sequencing, microarray technology has emerged as a widely used platform
for genomic studies in the life sciences. Microarray technology provides a systematic way to …
for genomic studies in the life sciences. Microarray technology provides a systematic way to …