A survey on distribution testing: Your data is big. But is it blue?

CL Canonne - Theory of Computing, 2020 - theoryofcomputing.org
The field of property testing originated in work on program checking, and has evolved into
an established and very active research area. In this work, we survey the developments of …

[KSIĄŻKA][B] Introduction to property testing

O Goldreich - 2017 - books.google.com
Property testing is concerned with the design of super-fast algorithms for the structural
analysis of large quantities of data. The aim is to unveil global features of the data, such as …

GANplifying event samples

A Butter, S Diefenbacher, G Kasieczka, B Nachman… - SciPost Physics, 2021 - scipost.org
A critical question concerning generative networks applied to event generation in particle
physics is if the generated events add statistical precision beyond the training sample. We …

Tolerant algorithms for learning with arbitrary covariate shift

S Goel, A Shetty, K Stavropoulos… - Advances in Neural …, 2025 - proceedings.neurips.cc
We study the problem of learning under arbitrary distribution shift, where the learner is
trained on a labeled set from one distribution but evaluated on a different, potentially …

Perfect sampling from pairwise comparisons

D Fotakis, A Kalavasis… - Advances in Neural …, 2022 - proceedings.neurips.cc
In this work, we study how to efficiently obtain perfect samples from a discrete distribution
$\mathcal {D} $ given access only to pairwise comparisons of elements of its support …

Sample amplification: Increasing dataset size even when learning is impossible

B Axelrod, S Garg, V Sharan… - … Conference on Machine …, 2020 - proceedings.mlr.press
Given data drawn from an unknown distribution, D, to what extent is it possible to “amplify”
this dataset and faithfully output an even larger set of samples that appear to have been …

[PDF][PDF] A survey on distribution testing

CL Canonne - Your Data is Big. But is it Blue, 2017 - cs.columbia.edu
The field of property testing originated in work on program checking, and has evolved into
an established and very active research area. In this work, we survey the developments of …

[KSIĄŻKA][B] Nature of Learning and Learning of Nature

S Garg - 2023 - search.proquest.com
This thesis explores questions surrounding the foundations of intelligence, both artificial and
natural. The first part focuses on the algorithmic and statistical underpinnings of modern …

Black-box methods for restoring monotonicity

E Gergatsouli, B Lucier… - … Conference on Machine …, 2020 - proceedings.mlr.press
In many practical applications, heuristic or approximation algorithms are used to efficiently
solve the task at hand. However their solutions frequently do not satisfy natural monotonicity …

Certified computation from unreliable datasets

T Gouleakis, C Tzamos… - Conference On Learning …, 2018 - proceedings.mlr.press
A wide range of learning tasks require human input in labeling massive data. The collected
data though are usually low quality and contain inaccuracies and errors. As a result, modern …