Verifying the union of manifolds hypothesis for image data

BCA Brown, AL Caterini, BL Ross, JC Cresswell… - arxiv preprint arxiv …, 2022 - arxiv.org
Deep learning has had tremendous success at learning low-dimensional representations of
high-dimensional data. This success would be impossible if there was no hidden low …

A geometric framework for understanding memorization in generative models

BL Ross, H Kamkari, T Wu, R Hosseinzadeh… - arxiv preprint arxiv …, 2024 - arxiv.org
As deep generative models have progressed, recent work has shown them to be capable of
memorizing and reproducing training datapoints when deployed. These findings call into …

Calochallenge 2022: A community challenge for fast calorimeter simulation

C Krause, MF Giannelli, G Kasieczka… - arxiv preprint arxiv …, 2024 - arxiv.org
We present the results of the" Fast Calorimeter Simulation Challenge 2022"-the
CaloChallenge. We study state-of-the-art generative models on four calorimeter shower …

Exploring Universal Intrinsic Task Subspace for Few-Shot Learning via Prompt Tuning

Y Qin, X Wang, Y Su, Y Lin, N Ding, J Yi… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Why can pre-trained language models (PLMs) learn universal representations and
effectively adapt to broad NLP tasks differing a lot superficially? In this work, we empirically …

Intrinsic dimensionality estimation using normalizing flows

C Horvat, JP Pfister - Advances in Neural Information …, 2022 - proceedings.neurips.cc
How many degrees of freedom are there in a dataset consisting of $ M $ samples embedded
in $\mathbb {R}^ D $? This number, formally known as\textsl {intrinsic dimensionality}, can …

One-line-of-code data mollification improves optimization of likelihood-based generative models

BH Tran, G Franzese, P Michiardi… - Advances in Neural …, 2023 - proceedings.neurips.cc
Abstract Generative Models (GMs) have attracted considerable attention due to their
tremendous success in various domains, such as computer vision where they are capable to …

Dimensionality-Aware Outlier Detection

A Anderberg, J Bailey, RJGB Campello, ME Houle… - Proceedings of the 2024 …, 2024 - SIAM
We present a nonparametric method for outlier detection that takes full account of local
variations in intrinsic dimensionality within the dataset. Using the theory of Local Intrinsic …

Local intrinsic dimensionality, entropy and statistical divergences

J Bailey, ME Houle, X Ma - Entropy, 2022 - mdpi.com
Properties of data distributions can be assessed at both global and local scales. At a highly
localized scale, a fundamental measure is the local intrinsic dimensionality (LID), which …

Denoising deep generative models

G Loaiza-Ganem, BL Ross, L Wu… - Proceedings …, 2023 - proceedings.mlr.press
Likelihood-based deep generative models have recently been shown to exhibit pathological
behaviour under the manifold hypothesis as a consequence of using high-dimensional …

Impact of dataset size and long-term ECoG-based BCI usage on deep learning decoders performance

M Śliwowski, M Martin, A Souloumiac… - Frontiers in Human …, 2023 - frontiersin.org
Introduction In brain-computer interfaces (BCI) research, recording data is time-consuming
and expensive, which limits access to big datasets. This may influence the BCI system …