Interpretable and explainable machine learning: A methods‐centric overview with concrete examples

R Marcinkevičs, JE Vogt - Wiley Interdisciplinary Reviews: Data …, 2023 - Wiley Online Library
Interpretability and explainability are crucial for machine learning (ML) and statistical
applications in medicine, economics, law, and natural sciences and form an essential …

Interpretability and explainability: A machine learning zoo mini-tour

R Marcinkevičs, JE Vogt - arxiv preprint arxiv:2012.01805, 2020 - arxiv.org
In this review, we examine the problem of designing interpretable and explainable machine
learning models. Interpretability and explainability lie at the core of many machine learning …

Permutation-based identification of important biomarkers for complex diseases via machine learning models

X Mi, B Zou, F Zou, J Hu - Nature communications, 2021 - nature.com
Study of human disease remains challenging due to convoluted disease etiologies and
complex molecular mechanisms at genetic, genomic, and proteomic levels. Many machine …

A pan-tissue DNA-methylation epigenetic clock based on deep learning

LP de Lima Camillo, LR Lapierre, R Singh - npj Aging, 2022 - nature.com
Several age predictors based on DNA methylation, dubbed epigenetic clocks, have been
created in recent years, with the vast majority based on regularized linear regression. This …

Concrete autoencoders: Differentiable feature selection and reconstruction

MF Balın, A Abid, J Zou - International conference on …, 2019 - proceedings.mlr.press
We introduce the concrete autoencoder, an end-to-end differentiable method for global
feature selection, which efficiently identifies a subset of the most informative features and …

Epigenetic ageing clocks: statistical methods and emerging computational challenges

AE Teschendorff, S Horvath - Nature Reviews Genetics, 2025 - nature.com
Over the past decade, epigenetic clocks have emerged as powerful machine learning tools,
not only to estimate chronological and biological age but also to assess the efficacy of anti …

Deep knockoffs

Y Romano, M Sesia, E Candès - Journal of the American Statistical …, 2020 - Taylor & Francis
This article introduces a machine for sampling approximate model-X knockoffs for arbitrary
and unspecified data distributions using deep generative models. The main idea is to …

Computational frameworks integrating deep learning and statistical models in mining multimodal omics data

L Lac, CK Leung, P Hu - Journal of Biomedical Informatics, 2024 - Elsevier
Background In health research, multimodal omics data analysis is widely used to address
important clinical and biological questions. Traditional statistical methods rely on the strong …

High dimensional, tabular deep learning with an auxiliary knowledge graph

C Ruiz, H Ren, K Huang… - Advances in Neural …, 2023 - proceedings.neurips.cc
Abstract Machine learning models exhibit strong performance on datasets with abundant
labeled samples. However, for tabular datasets with extremely high $ d $-dimensional …

A performance-driven benchmark for feature selection in tabular deep learning

V Cherepanova, R Levin, G Somepalli… - Advances in …, 2023 - proceedings.neurips.cc
Academic tabular benchmarks often contain small sets of curated features. In contrast, data
scientists typically collect as many features as possible into their datasets, and even …