Learning relus via gradient descent
M Soltanolkotabi - Advances in neural information …, 2017 - proceedings.neurips.cc
In this paper we study the problem of learning Rectified Linear Units (ReLUs) which are
functions of the form $\vct {x}\mapsto\max (0,\langle\vct {w},\vct {x}\rangle) $ with $\vct …
functions of the form $\vct {x}\mapsto\max (0,\langle\vct {w},\vct {x}\rangle) $ with $\vct …
Linear neural network layers promote learning single-and multiple-index models
This paper explores the implicit bias of overparameterized neural networks of depth greater
than two layers. Our framework considers a family of networks of varying depths that all have …
than two layers. Our framework considers a family of networks of varying depths that all have …
Algebraic variety models for high-rank matrix completion
We consider a non-linear generalization of low-rank matrix completion to the case where the
data belongs to an algebraic variety, ie, each data point is a solution to a system of …
data belongs to an algebraic variety, ie, each data point is a solution to a system of …
Non-iterative recovery from nonlinear observations using generative models
In this paper, we aim to estimate the direction of an underlying signal from its nonlinear
observations following the semi-parametric single index model (SIM). Unlike for …
observations following the semi-parametric single index model (SIM). Unlike for …
Least squares estimation in the monotone single index model
F Balabdaoui, C Durot, H Jankowski - Bernoulli, 2019 - projecteuclid.org
We study the monotone single index model where a real response variable $ Y $ is linked to
a $ d $-dimensional covariate $ X $ through the relationship $ E [Y| X]=\Psi_ {0}(\alpha^{T} …
a $ d $-dimensional covariate $ X $ through the relationship $ E [Y| X]=\Psi_ {0}(\alpha^{T} …
Active learning for single neuron models with lipschitz non-linearities
We consider the problem of active learning for single neuron models, also sometimes called
“ridge functions”, in the agnostic setting (under adversarial label noise). Such models have …
“ridge functions”, in the agnostic setting (under adversarial label noise). Such models have …
Inference in high-dimensional single-index models under symmetric designs
The problem of statistical inference for regression coefficients in a high-dimensional single-
index model is considered. Under elliptical symmetry, the single index model can be …
index model is considered. Under elliptical symmetry, the single index model can be …
Semiparametric efficiency in convexity constrained single-index model
We consider estimation and inference in a single-index regression model with an unknown
convex link function. We introduce a convex and Lipschitz constrained least-square …
convex link function. We introduce a convex and Lipschitz constrained least-square …
Fast algorithms for demixing sparse signals from nonlinear observations
We study the problem of demixing a pair of sparse signals from noisy, nonlinear
observations of their superposition. Mathematically, we consider a nonlinear signal …
observations of their superposition. Mathematically, we consider a nonlinear signal …
Misspecified phase retrieval with generative priors
In this paper, we study phase retrieval under model misspecification and generative priors.
In particular, we aim to estimate an $ n $-dimensional signal $\mathbf {x} $ from $ m $ iid …
In particular, we aim to estimate an $ n $-dimensional signal $\mathbf {x} $ from $ m $ iid …