Automatic feature selection and weighting in molecular systems using Differentiable Information Imbalance

R Wild, F Wodaczek, V Del Tatto, B Cheng… - Nature …, 2025 - nature.com
Feature selection is essential in the analysis of molecular systems and many other fields, but
several uncertainties remain: What is the optimal number of features for a simplified …

The generalized ratios intrinsic dimension estimator

F Denti, D Doimo, A Laio, A Mira - Scientific Reports, 2022 - nature.com
Modern datasets are characterized by numerous features related by complex dependency
structures. To deal with these data, dimensionality reduction techniques are essential. Many …

Coarse-grained molecular dynamics with normalizing flows

S Tamagnone, A Laio, M Gabrié - Journal of Chemical Theory and …, 2024 - ACS Publications
We propose a sampling algorithm relying on a collective variable (CV) of midsize dimension
modeled by a normalizing flow and using nonequilibrium dynamics to propose full …

Intrinsic dimension as a multi-scale summary statistics in network modeling

I Macocco, A Mira, A Laio - Scientific Reports, 2024 - nature.com
Complex networks are powerful mathematical tools for modelling and understanding the
behaviour of highly interconnected systems. However, existing methods for analyzing these …

Intrinsic dimension estimation for discrete metrics

I Macocco, A Glielmo, J Grilli, A Laio - Physical Review Letters, 2023 - APS
Real-world datasets characterized by discrete features are ubiquitous: from categorical
surveys to clinical questionnaires, from unweighted networks to DNA sequences …

Investigating the price determinants of the European Emission Trading System: a non-parametric approach

C Salvagnin, A Glielmo, ME De Giuli, A Mira - Quantitative Finance, 2024 - Taylor & Francis
Understanding the intricacies of factors influencing European Union Emission Trading
System (EU ETS) market prices is paramount for effective policy making and strategy …

Coarse-Graining and Forecasting Atomic Material Simulations with Descriptors

TD Swinburne - Physical Review Letters, 2023 - APS
Atomic simulations of materials require significant resources to generate, store, and analyze.
Here, descriptor functions are proposed as a general, metric latent space for atomic …

Maximally informative feature selection using Information Imbalance: Application to COVID-19 severity prediction

R Wild, E Sozio, RG Margiotta, F Dellai… - Scientific Reports, 2024 - nature.com
Clinical databases typically include, for each patient, many heterogeneous features, for
example blood exams, the clinical history before the onset of the disease, the evolution of …

How complex are galaxies? A non-parametric estimation of the intrinsic dimensionality of wide-band photometric data

C Cadiou, C Laigle, O Agertz - Monthly Notices of the Royal …, 2025 - academic.oup.com
Galaxies are complex objects, yet the number of independent parameters to describe them
remains unknown. We present here a non-parametric method to estimate the intrinsic …

Robust inference of causality in high-dimensional dynamical processes from the Information Imbalance of distance ranks

V Del Tatto, G Fortunato, D Bueti, A Laio - PROCEEDINGS OF THE …, 2024 - iris.sissa.it
We introduce an approach which allows detecting causal relationships between variables
for which the time evolution is available. Causality is assessed by a variational scheme …