Transforming big data into smart data: An insight on the use of the k‐nearest neighbors algorithm to obtain quality data

I Triguero, D García‐Gil, J Maillo… - … : Data Mining and …, 2019 - Wiley Online Library
The k‐nearest neighbors algorithm is characterized as a simple yet effective data mining
technique. The main drawback of this technique appears when massive amounts of data …

Considerations of nano-QSAR/QSPR models for nanopesticide risk assessment within the European legislative framework

JJ Villaverde, B Sevilla-Morán, C López-Goti… - Science of the Total …, 2018 - Elsevier
The European market for pesticides is currently legislated through the well-developed
Regulation (EC) No. 1107/2009. This regulation promotes the competitiveness of European …

[KNJIGA][B] Learning theory from first principles

F Bach - 2024 - books.google.com
A comprehensive and cutting-edge introduction to the foundations and modern applications
of learning theory. Research has exploded in the field of machine learning resulting in …

Explaining the success of nearest neighbor methods in prediction

GH Chen, D Shah - Foundations and Trends® in Machine …, 2018 - nowpublishers.com
Many modern methods for prediction leverage nearest neighbor search to find past training
examples most similar to a test example, an idea that dates back in text to at least the 11th …

Estimating mutual information for discrete-continuous mixtures

W Gao, S Kannan, S Oh… - Advances in neural …, 2017 - proceedings.neurips.cc
Estimation of mutual information from observed samples is a basic primitive in machine
learning, useful in several learning tasks including correlation mining, information …

Background modeling for double Higgs boson production: Density ratios and optimal transport

T Manole, P Bryant, J Alison, M Kuusela… - The Annals of Applied …, 2024 - projecteuclid.org
This supplementary material consists of Appendix A, containing a section-by-section
summary of this manuscript in nontechnical language, Appendices B–D, containing …

Adaptive transfer learning

HWJ Reeve, TI Cannings, RJ Samworth - The Annals of Statistics, 2021 - JSTOR
In transfer learning, we wish to make inference about a target population when we have
access to data both from the distribution itself, and from a different but related source …

Demystifying Fixed -Nearest Neighbor Information Estimators

W Gao, S Oh, P Viswanath - IEEE Transactions on Information …, 2018 - ieeexplore.ieee.org
Estimating mutual information from independent identically distributed samples drawn from
an unknown joint density function is a basic statistical problem of broad interest with …

Efficient multivariate entropy estimation via -nearest neighbour distances

TB Berrett, RJ Samworth, M Yuan - 2019 - projecteuclid.org
Efficient multivariate entropy estimation via k-nearest neighbour distances Page 1 The Annals of
Statistics 2019, Vol. 47, No. 1, 288–318 https://doi.org/10.1214/18-AOS1688 © Institute of …

Stochastic optimization forests

N Kallus, X Mao - Management Science, 2023 - pubsonline.informs.org
We study contextual stochastic optimization problems, where we leverage rich auxiliary
observations (eg, product characteristics) to improve decision making with uncertain …