Semi-supervised learning by disagreement

ZH Zhou, M Li - Knowledge and Information Systems, 2010 - Springer
In many real-world tasks, there are abundant unlabeled examples but the number of labeled
training examples is limited, because labeling the examples requires human efforts and …

Causal inference in natural language processing: Estimation, prediction, interpretation and beyond

A Feder, KA Keith, E Manzoor, R Pryzant… - Transactions of the …, 2022 - direct.mit.edu
A fundamental goal of scientific research is to learn about causal relationships. However,
despite its critical role in the life and social sciences, causality has not had the same …

Semi-supervised data modeling and analytics in the process industry: Current research status and challenges

Z Ge - IFAC Journal of systems and control, 2021 - Elsevier
Semi-supervised data are quite common in the process industry, which has caught much
attention in recent years. The semi-supervised feature of process data not only has a great …

Tri-training: Exploiting unlabeled data using three classifiers

ZH Zhou, M Li - IEEE Transactions on knowledge and Data …, 2005 - ieeexplore.ieee.org
In many practical data mining applications, such as Web page classification, unlabeled
training examples are readily available, but labeled ones are fairly expensive to obtain …

[PDF][PDF] Effective self-training for parsing

D McClosky, E Charniak… - Proceedings of the human …, 2006 - aclanthology.org
We present a simple, but surprisingly effective, method of self-training a twophase parser-
reranker system using readily available unlabeled data. We show that this type of …

Improve computer-aided diagnosis with machine learning techniques using undiagnosed samples

M Li, ZH Zhou - IEEE Transactions on Systems, Man, and …, 2007 - ieeexplore.ieee.org
In computer-aided diagnosis (CAD), machine learning techniques have been widely applied
to learn a hypothesis from diagnosed samples to assist the medical experts in making a …

Natural disasters detection in social media and satellite imagery: a survey

N Said, K Ahmad, M Riegler, K Pogorelov… - Multimedia Tools and …, 2019 - Springer
The analysis of natural disaster-related multimedia content got great attention in recent
years. Being one of the most important sources of information, social media have been …

[PDF][PDF] Semi-supervised regression with co-training.

ZH Zhou, M Li - IJCAI, 2005 - mit.bme.hu
In many practical machine learning and data mining applications, unlabeled training
examples are readily available but labeled ones are fairly expensive to obtain. Therefore …

Bootstrap** parsers via syntactic projection across parallel texts

R Hwa, P Resnik, A Weinberg, C Cabezas… - Natural language …, 2005 - cambridge.org
Broad coverage, high quality parsers are available for only a handful of languages. A
prerequisite for develo** broad coverage parsers for more languages is the annotation of …

Sample-based software defect prediction with active and semi-supervised learning

M Li, H Zhang, R Wu, ZH Zhou - Automated Software Engineering, 2012 - Springer
Software defect prediction can help us better understand and control software quality.
Current defect prediction techniques are mainly based on a sufficient amount of historical …