A survey on unsupervised outlier detection in high‐dimensional numerical data
High‐dimensional data in Euclidean space pose special challenges to data mining
algorithms. These challenges are often indiscriminately subsumed under the term 'curse of …
algorithms. These challenges are often indiscriminately subsumed under the term 'curse of …
The dimensionality and structure of species trait spaces
Trait‐based ecology aims to understand the processes that generate the overarching
diversity of organismal traits and their influence on ecosystem functioning. Achieving this …
diversity of organismal traits and their influence on ecosystem functioning. Achieving this …
The art of using t-SNE for single-cell transcriptomics
Single-cell transcriptomics yields ever growing data sets containing RNA expression levels
for thousands of genes from up to millions of cells. Common data analysis pipelines include …
for thousands of genes from up to millions of cells. Common data analysis pipelines include …
Toward a quantitative survey of dimension reduction techniques
Dimensionality reduction methods, also known as projections, are frequently used in
multidimensional data exploration in machine learning, data science, and information …
multidimensional data exploration in machine learning, data science, and information …
Parametric UMAP embeddings for representation and semisupervised learning
UMAP is a nonparametric graph-based dimensionality reduction algorithm using applied
Riemannian geometry and algebraic topology to find low-dimensional embeddings of …
Riemannian geometry and algebraic topology to find low-dimensional embeddings of …
Visualizing high-dimensional data: Advances in the past decade
Massive simulations and arrays of sensing devices, in combination with increasing
computing resources, have generated large, complex, high-dimensional datasets used to …
computing resources, have generated large, complex, high-dimensional datasets used to …
Multidimensional projection for visual analytics: Linking techniques with distortions, tasks, and layout enrichment
Visual analysis of multidimensional data requires expressive and effective ways to reduce
data dimensionality to encode them visually. Multidimensional projections (MDP) figure …
data dimensionality to encode them visually. Multidimensional projections (MDP) figure …
Topological autoencoders
We propose a novel approach for preserving topological structures of the input space in
latent representations of autoencoders. Using persistent homology, a technique from …
latent representations of autoencoders. Using persistent homology, a technique from …
t-visne: Interactive assessment and interpretation of t-sne projections
t-Distributed Stochastic Neighbor Embedding (t-SNE) for the visualization of
multidimensional data has proven to be a popular approach, with successful applications in …
multidimensional data has proven to be a popular approach, with successful applications in …
[PDF][PDF] Making machine learning models interpretable.
Data of different levels of complexity and of ever growing diversity of characteristics are the
raw materials that machine learning practitioners try to model using their wide palette of …
raw materials that machine learning practitioners try to model using their wide palette of …