A survey on unsupervised outlier detection in high‐dimensional numerical data

A Zimek, E Schubert, HP Kriegel - Statistical Analysis and Data …, 2012 - Wiley Online Library
High‐dimensional data in Euclidean space pose special challenges to data mining
algorithms. These challenges are often indiscriminately subsumed under the term 'curse of …

The dimensionality and structure of species trait spaces

D Mouillot, N Loiseau, M Grenié, AC Algar… - Ecology …, 2021 - Wiley Online Library
Trait‐based ecology aims to understand the processes that generate the overarching
diversity of organismal traits and their influence on ecosystem functioning. Achieving this …

The art of using t-SNE for single-cell transcriptomics

D Kobak, P Berens - Nature communications, 2019 - nature.com
Single-cell transcriptomics yields ever growing data sets containing RNA expression levels
for thousands of genes from up to millions of cells. Common data analysis pipelines include …

Toward a quantitative survey of dimension reduction techniques

M Espadoto, RM Martins, A Kerren… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
Dimensionality reduction methods, also known as projections, are frequently used in
multidimensional data exploration in machine learning, data science, and information …

Parametric UMAP embeddings for representation and semisupervised learning

T Sainburg, L McInnes, TQ Gentner - Neural Computation, 2021 - direct.mit.edu
UMAP is a nonparametric graph-based dimensionality reduction algorithm using applied
Riemannian geometry and algebraic topology to find low-dimensional embeddings of …

Visualizing high-dimensional data: Advances in the past decade

S Liu, D Maljovec, B Wang, PT Bremer… - IEEE transactions on …, 2016 - ieeexplore.ieee.org
Massive simulations and arrays of sensing devices, in combination with increasing
computing resources, have generated large, complex, high-dimensional datasets used to …

Multidimensional projection for visual analytics: Linking techniques with distortions, tasks, and layout enrichment

LG Nonato, M Aupetit - IEEE Transactions on Visualization and …, 2018 - ieeexplore.ieee.org
Visual analysis of multidimensional data requires expressive and effective ways to reduce
data dimensionality to encode them visually. Multidimensional projections (MDP) figure …

Topological autoencoders

M Moor, M Horn, B Rieck… - … conference on machine …, 2020 - proceedings.mlr.press
We propose a novel approach for preserving topological structures of the input space in
latent representations of autoencoders. Using persistent homology, a technique from …

t-visne: Interactive assessment and interpretation of t-sne projections

A Chatzimparmpas, RM Martins… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
t-Distributed Stochastic Neighbor Embedding (t-SNE) for the visualization of
multidimensional data has proven to be a popular approach, with successful applications in …

[PDF][PDF] Making machine learning models interpretable.

A Vellido, JD Martín-Guerrero, PJG Lisboa - ESANN, 2012 - Citeseer
Data of different levels of complexity and of ever growing diversity of characteristics are the
raw materials that machine learning practitioners try to model using their wide palette of …