Density-ratio matching under the bregman divergence: a unified framework of density-ratio estimation

M Sugiyama, T Suzuki, T Kanamori - Annals of the Institute of Statistical …, 2012 - Springer
Estimation of the ratio of probability densities has attracted a great deal of attention since it
can be used for addressing various statistical paradigms. A naive approach to density-ratio …

Learning from positive and unlabeled data: A survey

J Bekker, J Davis - Machine Learning, 2020 - Springer
Learning from positive and unlabeled data or PU learning is the setting where a learner only
has access to positive examples and unlabeled data. The assumption is that the unlabeled …

DC programming and DCA: thirty years of developments

HA Le Thi, T Pham Dinh - Mathematical Programming, 2018 - Springer
The year 2015 marks the 30th birthday of DC (Difference of Convex functions) programming
and DCA (DC Algorithms) which constitute the backbone of nonconvex programming and …

Convex formulation for learning from positive and unlabeled data

M Du Plessis, G Niu… - … conference on machine …, 2015 - proceedings.mlr.press
We discuss binary classification from only from positive and unlabeled data (PU
classification), which is conceivable in various real-world machine learning problems. Since …

[BOK][B] Machine learning in non-stationary environments: Introduction to covariate shift adaptation

M Sugiyama, M Kawanabe - 2012 - books.google.com
Theory, algorithms, and applications of machine learning techniques to overcome “covariate
shift” non-stationarity. As the power of computing has grown over the past few decades, the …

Estimating divergence functionals and the likelihood ratio by convex risk minimization

XL Nguyen, MJ Wainwright… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
We develop and analyze M-estimation methods for divergence functionals and the
likelihood ratios of two probability distributions. Our method is based on a nonasymptotic …

Scalable kernel methods via doubly stochastic gradients

B Dai, B **e, N He, Y Liang, A Raj… - Advances in neural …, 2014 - proceedings.neurips.cc
The general perception is that kernel methods are not scalable, so neural nets become the
choice for large-scale nonlinear learning problems. Have we tried hard enough for kernel …

[PDF][PDF] Semi-supervised novelty detection

G Blanchard, G Lee, C Scott - The Journal of Machine Learning Research, 2010 - jmlr.org
A common setting for novelty detection assumes that labeled examples from the nominal
class are available, but that labeled examples of novelties are unavailable. The standard …

One-class classification with gaussian processes

M Kemmler, E Rodner, ES Wacker, J Denzler - Pattern recognition, 2013 - Elsevier
Detecting instances of unknown categories is an important task for a multitude of problems
such as object recognition, event detection, and defect localization. This article investigates …

Statistical outlier detection using direct density ratio estimation

S Hido, Y Tsuboi, H Kashima, M Sugiyama… - … and information systems, 2011 - Springer
We propose a new statistical approach to the problem of inlier-based outlier detection, ie,
finding outliers in the test set based on the training set consisting only of inliers. Our key idea …