Discovering causal relations and equations from data
Physics is a field of science that has traditionally used the scientific method to answer
questions about why natural phenomena occur and to make testable models that explain the …
questions about why natural phenomena occur and to make testable models that explain the …
Gaussian processes and kernel methods: A review on connections and equivalences
This paper is an attempt to bridge the conceptual gaps between researchers working on the
two widely used approaches based on positive definite kernels: Bayesian learning or …
two widely used approaches based on positive definite kernels: Bayesian learning or …
Do vision transformers see like convolutional neural networks?
Convolutional neural networks (CNNs) have so far been the de-facto model for visual data.
Recent work has shown that (Vision) Transformer models (ViT) can achieve comparable or …
Recent work has shown that (Vision) Transformer models (ViT) can achieve comparable or …
[BOOK][B] Elements of causal inference: foundations and learning algorithms
A concise and self-contained introduction to causal inference, increasingly important in data
science and machine learning. The mathematization of causality is a relatively recent …
science and machine learning. The mathematization of causality is a relatively recent …
f-gan: Training generative neural samplers using variational divergence minimization
Generative neural networks are probabilistic models that implement sampling using
feedforward neural networks: they take a random input vector and produce a sample from a …
feedforward neural networks: they take a random input vector and produce a sample from a …
[PDF][PDF] A kernel two-sample test
We propose a framework for analyzing and comparing distributions, which we use to
construct statistical tests to determine if two samples are drawn from different distributions …
construct statistical tests to determine if two samples are drawn from different distributions …
Deep stable learning for out-of-distribution generalization
Approaches based on deep neural networks have achieved striking performance when
testing data and training data share similar distribution, but can significantly fail otherwise …
testing data and training data share similar distribution, but can significantly fail otherwise …
A kernel method for the two-sample-problem
We propose two statistical tests to determine if two samples are from different distributions.
Our test statistic is in both cases the distance between the means of the two samples …
Our test statistic is in both cases the distance between the means of the two samples …
Consensus graph learning for multi-view clustering
Multi-view clustering, which exploits the multi-view information to partition data into their
clusters, has attracted intense attention. However, most existing methods directly learn a …
clusters, has attracted intense attention. However, most existing methods directly learn a …
One-for-all: Bridge the gap between heterogeneous architectures in knowledge distillation
Abstract Knowledge distillation (KD) has proven to be a highly effective approach for
enhancing model performance through a teacher-student training scheme. However, most …
enhancing model performance through a teacher-student training scheme. However, most …