Qa dataset explosion: A taxonomy of nlp resources for question answering and reading comprehension

A Rogers, M Gardner, I Augenstein - ACM Computing Surveys, 2023 - dl.acm.org
Alongside huge volumes of research on deep learning models in NLP in the recent years,
there has been much work on benchmark datasets needed to track modeling progress …

A survey of evaluation metrics used for NLG systems

AB Sai, AK Mohankumar, MM Khapra - ACM Computing Surveys (CSUR …, 2022 - dl.acm.org
In the last few years, a large number of automatic evaluation metrics have been proposed for
evaluating Natural Language Generation (NLG) systems. The rapid development and …

Coqa: A conversational question answering challenge

S Reddy, D Chen, CD Manning - Transactions of the Association for …, 2019 - direct.mit.edu
Humans gather information through conversations involving a series of interconnected
questions and answers. For machines to assist in information gathering, it is therefore …

Logiqa: A challenge dataset for machine reading comprehension with logical reasoning

J Liu, L Cui, H Liu, D Huang, Y Wang… - arxiv preprint arxiv …, 2020 - arxiv.org
Machine reading is a fundamental task for testing the capability of natural language
understanding, which is closely related to human cognition in many aspects. With the rising …

Unsupervised commonsense question answering with self-talk

V Shwartz, P West, RL Bras, C Bhagavatula… - arxiv preprint arxiv …, 2020 - arxiv.org
Natural language understanding involves reading between the lines with implicit
background knowledge. Current systems either rely on pre-trained language models as the …

A survey on empathetic dialogue systems

Y Ma, KL Nguyen, FZ **ng, E Cambria - Information Fusion, 2020 - Elsevier
Dialogue systems have achieved growing success in many areas thanks to the rapid
advances of machine learning techniques. In the quest for generating more human-like …

DREAM: A challenge data set and models for dialogue-based reading comprehension

K Sun, D Yu, J Chen, D Yu, Y Choi… - Transactions of the …, 2019 - direct.mit.edu
We present DREAM, the first dialogue-based multiple-choice reading comprehension data
set. Collected from English as a Foreign Language examinations designed by human …

" Going on a vacation" takes longer than" Going for a walk": A Study of Temporal Commonsense Understanding

B Zhou, D Khashabi, Q Ning, D Roth - arxiv preprint arxiv:1909.03065, 2019 - arxiv.org
Understanding time is crucial for understanding events expressed in natural language.
Because people rarely say the obvious, it is often necessary to have commonsense …

Enhancing pre-trained language representations with rich knowledge for machine reading comprehension

A Yang, Q Wang, J Liu, K Liu, Y Lyu, H Wu… - Proceedings of the …, 2019 - aclanthology.org
Abstract Machine reading comprehension (MRC) is a crucial and challenging task in NLP.
Recently, pre-trained language models (LMs), especially BERT, have achieved remarkable …

Commonsense knowledge base completion with structural and semantic context

C Malaviya, C Bhagavatula, A Bosselut… - Proceedings of the AAAI …, 2020 - aaai.org
Automatic KB completion for commonsense knowledge graphs (eg, ATOMIC and
ConceptNet) poses unique challenges compared to the much studied conventional …