Relation extraction using distant supervision: A survey

A Smirnova, P Cudré-Mauroux - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
Relation extraction is a subtask of information extraction where semantic relationships are
extracted from natural language text and then classified. In essence, it allows us to acquire …

Snorkel: Rapid training data creation with weak supervision

A Ratner, SH Bach, H Ehrenberg… - Proceedings of the …, 2017 - pmc.ncbi.nlm.nih.gov
Labeling training data is increasingly the largest bottleneck in deploying machine learning
systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the …

Snorkel: rapid training data creation with weak supervision

A Ratner, SH Bach, H Ehrenberg, J Fries, S Wu, C Ré - The VLDB Journal, 2020 - Springer
Labeling training data is increasingly the largest bottleneck in deploying machine learning
systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the …

Data programming: Creating large training sets, quickly

AJ Ratner, CM De Sa, S Wu… - Advances in neural …, 2016 - proceedings.neurips.cc
Large labeled training sets are the critical building blocks of supervised learning methods
and are key enablers of deep learning techniques. For some applications, creating labeled …

Snuba: Automating weak supervision to label training data

P Varma, C Ré - … . International Conference on Very Large Data …, 2018 - pmc.ncbi.nlm.nih.gov
As deep learning models are applied to increasingly diverse problems, a key bottleneck is
gathering enough high-quality training labels tailored to each task. Users therefore turn to …

Knowledge graphs: An information retrieval perspective

R Reinanda, E Meij, M de Rijke - Foundations and Trends® …, 2020 - nowpublishers.com
In this survey, we provide an overview of the literature on knowledge graphs (KGs) in the
context of information retrieval (IR). Modern IR systems can benefit from information …

Training classifiers with natural language explanations

B Hancock, M Bringmann, P Varma… - Proceedings of the …, 2018 - pmc.ncbi.nlm.nih.gov
Training accurate classifiers requires many labels, but each label provides only limited
information (one bit for binary classification). In this work, we propose BabbleLabble, a …

Learning the structure of generative models without labeled data

SH Bach, B He, A Ratner, C Ré - … Conference on Machine …, 2017 - proceedings.mlr.press
Curating labeled training data has become the primary bottleneck in machine learning.
Recent frameworks address this bottleneck with generative models to synthesize labels at …

Weak supervision as an efficient approach for automated seizure detection in electroencephalography

K Saab, J Dunnmon, C Ré, D Rubin… - NPJ digital …, 2020 - nature.com
Automated seizure detection from electroencephalography (EEG) would improve the quality
of patient care while reducing medical costs, but achieving reliably high performance across …

Scene graph prediction with limited labels

VS Chen, P Varma, R Krishna… - Proceedings of the …, 2019 - openaccess.thecvf.com
Visual knowledge bases such as Visual Genome power numerous applications in computer
vision, including visual question answering and captioning, but suffer from sparse …