Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
Language models demonstrate both quantitative improvement and new qualitative
capabilities with increasing scale. Despite their potentially transformative impact, these new …
capabilities with increasing scale. Despite their potentially transformative impact, these new …
A survey on recognizing textual entailment as an NLP evaluation
A Poliak - arxiv preprint arxiv:2010.03061, 2020 - arxiv.org
Recognizing Textual Entailment (RTE) was proposed as a unified evaluation framework to
compare semantic understanding of different NLP systems. In this survey paper, we provide …
compare semantic understanding of different NLP systems. In this survey paper, we provide …
An overview of temporal commonsense reasoning and acquisition
G Wenzel, A Jatowt - arxiv preprint arxiv:2308.00002, 2023 - arxiv.org
Temporal commonsense reasoning refers to the ability to understand the typical temporal
context of phrases, actions, and events, and use it to reason over problems requiring such …
context of phrases, actions, and events, and use it to reason over problems requiring such …
Test of time: Instilling video-language models with a sense of time
Modelling and understanding time remains a challenge in contemporary video
understanding models. With language emerging as a key driver towards powerful …
understanding models. With language emerging as a key driver towards powerful …
TIMEDIAL: Temporal commonsense reasoning in dialog
Everyday conversations require understanding everyday events, which in turn, requires
understanding temporal commonsense concepts interwoven with those events. Despite …
understanding temporal commonsense concepts interwoven with those events. Despite …
Temporal reasoning on implicit events from distant supervision
We propose TRACIE, a novel temporal reasoning dataset that evaluates the degree to which
systems understand implicit events--events that are not mentioned explicitly in natural …
systems understand implicit events--events that are not mentioned explicitly in natural …
Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing
Objective Seizure frequency and seizure freedom are among the most important outcome
measures for patients with epilepsy. In this study, we aimed to automatically extract this …
measures for patients with epilepsy. In this study, we aimed to automatically extract this …
Doctime: A document-level temporal dependency graph parser
We introduce DocTime-a novel temporal dependency graph (TDG) parser that takes as input
a text document and produces a temporal dependency graph. It outperforms previous BERT …
a text document and produces a temporal dependency graph. It outperforms previous BERT …
A dataset for hyper-relational extraction and a cube-filling approach
Relation extraction has the potential for large-scale knowledge graph construction, but
current methods do not consider the qualifier attributes for each relation triplet, such as time …
current methods do not consider the qualifier attributes for each relation triplet, such as time …
Figurative language in recognizing textual entailment
We introduce a collection of recognizing textual entailment (RTE) datasets focused on
figurative language. We leverage five existing datasets annotated for a variety of figurative …
figurative language. We leverage five existing datasets annotated for a variety of figurative …