Automatic feature selection and weighting in molecular systems using Differentiable Information Imbalance
Feature selection is essential in the analysis of molecular systems and many other fields, but
several uncertainties remain: What is the optimal number of features for a simplified …
several uncertainties remain: What is the optimal number of features for a simplified …
The generalized ratios intrinsic dimension estimator
Modern datasets are characterized by numerous features related by complex dependency
structures. To deal with these data, dimensionality reduction techniques are essential. Many …
structures. To deal with these data, dimensionality reduction techniques are essential. Many …
Coarse-grained molecular dynamics with normalizing flows
We propose a sampling algorithm relying on a collective variable (CV) of midsize dimension
modeled by a normalizing flow and using nonequilibrium dynamics to propose full …
modeled by a normalizing flow and using nonequilibrium dynamics to propose full …
Intrinsic dimension as a multi-scale summary statistics in network modeling
Complex networks are powerful mathematical tools for modelling and understanding the
behaviour of highly interconnected systems. However, existing methods for analyzing these …
behaviour of highly interconnected systems. However, existing methods for analyzing these …
Intrinsic dimension estimation for discrete metrics
Real-world datasets characterized by discrete features are ubiquitous: from categorical
surveys to clinical questionnaires, from unweighted networks to DNA sequences …
surveys to clinical questionnaires, from unweighted networks to DNA sequences …
Investigating the price determinants of the European Emission Trading System: a non-parametric approach
Understanding the intricacies of factors influencing European Union Emission Trading
System (EU ETS) market prices is paramount for effective policy making and strategy …
System (EU ETS) market prices is paramount for effective policy making and strategy …
Coarse-Graining and Forecasting Atomic Material Simulations with Descriptors
TD Swinburne - Physical Review Letters, 2023 - APS
Atomic simulations of materials require significant resources to generate, store, and analyze.
Here, descriptor functions are proposed as a general, metric latent space for atomic …
Here, descriptor functions are proposed as a general, metric latent space for atomic …
Maximally informative feature selection using Information Imbalance: Application to COVID-19 severity prediction
Clinical databases typically include, for each patient, many heterogeneous features, for
example blood exams, the clinical history before the onset of the disease, the evolution of …
example blood exams, the clinical history before the onset of the disease, the evolution of …
How complex are galaxies? A non-parametric estimation of the intrinsic dimensionality of wide-band photometric data
Galaxies are complex objects, yet the number of independent parameters to describe them
remains unknown. We present here a non-parametric method to estimate the intrinsic …
remains unknown. We present here a non-parametric method to estimate the intrinsic …
Robust inference of causality in high-dimensional dynamical processes from the Information Imbalance of distance ranks
We introduce an approach which allows detecting causal relationships between variables
for which the time evolution is available. Causality is assessed by a variational scheme …
for which the time evolution is available. Causality is assessed by a variational scheme …