Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

A Srivastava, A Rastogi, A Rao, AAM Shoeb… - arxiv preprint arxiv …, 2022 - arxiv.org
Language models demonstrate both quantitative improvement and new qualitative
capabilities with increasing scale. Despite their potentially transformative impact, these new …

A survey on recognizing textual entailment as an NLP evaluation

A Poliak - arxiv preprint arxiv:2010.03061, 2020 - arxiv.org
Recognizing Textual Entailment (RTE) was proposed as a unified evaluation framework to
compare semantic understanding of different NLP systems. In this survey paper, we provide …

An overview of temporal commonsense reasoning and acquisition

G Wenzel, A Jatowt - arxiv preprint arxiv:2308.00002, 2023 - arxiv.org
Temporal commonsense reasoning refers to the ability to understand the typical temporal
context of phrases, actions, and events, and use it to reason over problems requiring such …

Test of time: Instilling video-language models with a sense of time

P Bagad, M Tapaswi… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Modelling and understanding time remains a challenge in contemporary video
understanding models. With language emerging as a key driver towards powerful …

TIMEDIAL: Temporal commonsense reasoning in dialog

L Qin, A Gupta, S Upadhyay, L He, Y Choi… - arxiv preprint arxiv …, 2021 - arxiv.org
Everyday conversations require understanding everyday events, which in turn, requires
understanding temporal commonsense concepts interwoven with those events. Despite …

Temporal reasoning on implicit events from distant supervision

B Zhou, K Richardson, Q Ning, T Khot… - arxiv preprint arxiv …, 2020 - arxiv.org
We propose TRACIE, a novel temporal reasoning dataset that evaluates the degree to which
systems understand implicit events--events that are not mentioned explicitly in natural …

Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing

K **e, RS Gallagher, EC Conrad… - Journal of the …, 2022 - academic.oup.com
Objective Seizure frequency and seizure freedom are among the most important outcome
measures for patients with epilepsy. In this study, we aimed to automatically extract this …

Doctime: A document-level temporal dependency graph parser

P Mathur, V Morariu, V Kaynig-Fittkau… - Proceedings of the …, 2022 - aclanthology.org
We introduce DocTime-a novel temporal dependency graph (TDG) parser that takes as input
a text document and produces a temporal dependency graph. It outperforms previous BERT …

A dataset for hyper-relational extraction and a cube-filling approach

YK Chia, L Bing, SM Aljunied, L Si, S Poria - arxiv preprint arxiv …, 2022 - arxiv.org
Relation extraction has the potential for large-scale knowledge graph construction, but
current methods do not consider the qualifier attributes for each relation triplet, such as time …

Figurative language in recognizing textual entailment

T Chakrabarty, D Ghosh, A Poliak… - arxiv preprint arxiv …, 2021 - arxiv.org
We introduce a collection of recognizing textual entailment (RTE) datasets focused on
figurative language. We leverage five existing datasets annotated for a variety of figurative …