- Academic Search

AN Uma, T Fornaciari, D Hovy, S Paun, B Plank… - Journal of Artificial …, 2021 - jair.org

Abstract Many tasks in Natural Language Processing (NLP) and Computer Vision (CV) offer
evidence that humans disagree, from objective tasks such as part-of-speech tagging to more …

保存引用被引用次数：204 相关文章所有 12 个版本 HTML 版

[Free GPT-4]

[PDF] annualreviews.org

Computational models of anaphora

M Poesio, J Yu, S Paun, A Aloraini, P Lu… - Annual Review of …, 2023 - annualreviews.org

Interpreting anaphoric references is a fundamental aspect of our language competence that
has long attracted the attention of computational linguists. The appearance of ever-larger …

保存引用被引用次数：23 相关文章所有 5 个版本

[Free GPT-4]

[PDF] mit.edu

Inherent disagreements in human textual inferences

E Pavlick, T Kwiatkowski - Transactions of the Association for …, 2019 - direct.mit.edu

We analyze human's disagreements about the validity of natural language inferences. We
show that, very often, disagreements are not dismissible as annotation “noise”, but rather …

保存引用被引用次数：296 相关文章所有 7 个版本

[Free GPT-4]

[PDF] mit.edu

Investigating reasons for disagreement in natural language inference

NJ Jiang, MC Marneffe - Transactions of the Association for …, 2022 - direct.mit.edu

We investigate how disagreement in natural language inference (NLI) annotation arises. We
developed a taxonomy of disagreement sources with 10 categories spanning 3 high-level …

保存引用被引用次数：53 相关文章所有 10 个版本

[Free GPT-4]

[PDF] unito.it

[PDF][PDF] We need to consider disagreement in evaluation

V Basile, M Fell, T Fornaciari, D Hovy, S Paun… - Proceedings of the 1st …, 2021 - iris.unito.it

Where have we been, and where are we going? It is easier to talk about the past than the
future. These days, benchmarks evolve more bottom up (such as papers with code). There …

保存引用被引用次数：124 相关文章所有 11 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

Quoref: A reading comprehension dataset with questions requiring coreferential reasoning

P Dasigi, NF Liu, A Marasović, NA Smith… - arxiv preprint arxiv …, 2019 - arxiv.org

Machine comprehension of texts longer than a single sentence often requires coreference
resolution. However, most current reading comprehension benchmarks do not contain …

保存引用被引用次数：200 相关文章所有 4 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

SemEval-2023 task 11: Learning with disagreements (LeWiDi)

E Leonardelli, A Uma, G Abercrombie… - arxiv preprint arxiv …, 2023 - arxiv.org

NLP datasets annotated with human judgments are rife with disagreements between the
judges. This is especially true for tasks depending on subjective judgments such as …

保存引用被引用次数：55 相关文章所有 10 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

An annotated dataset of coreference in English literature

D Bamman, O Lewke, A Mansoor - arxiv preprint arxiv:1912.01140, 2019 - arxiv.org

We present in this work a new dataset of coreference annotations for works of literature in
English, covering 29,103 mentions in 210,532 tokens from 100 works of fiction. This dataset …

保存引用被引用次数：132 相关文章所有 4 个版本 HTML 版

[Free GPT-4]

[PDF] aclanthology.org

SemEval-2021 task 12: Learning with disagreements

A Uma, T Fornaciari, A Dumitrache… - Proceedings of the …, 2021 - aclanthology.org

Disagreement between coders is ubiquitous in virtually all datasets annotated with human
judgements in both natural language processing and computer vision. However, most …

保存引用被引用次数：66 相关文章所有 9 个版本 HTML 版

[Free GPT-4]

[PDF] aaai.org

A case for soft loss functions

A Uma, T Fornaciari, D Hovy, S Paun, B Plank… - Proceedings of the …, 2020 - ojs.aaai.org

Recently, Peterson et al. provided evidence of the benefits of using probabilistic soft labels
generated from crowd annotations for training a computer vision model, showing that using …

保存引用被引用次数：57 相关文章所有 12 个版本 HTML 版

创建快讯

引用

高级搜索

已保存到“我的图书馆”

Identity, non-identity, and near-identity: Addressing the complexity of coreference

Learning from disagreement: A survey

Computational models of anaphora

Inherent disagreements in human textual inferences

Investigating reasons for disagreement in natural language inference

[PDF][PDF] We need to consider disagreement in evaluation

Quoref: A reading comprehension dataset with questions requiring coreferential reasoning

SemEval-2023 task 11: Learning with disagreements (LeWiDi)

An annotated dataset of coreference in English literature

SemEval-2021 task 12: Learning with disagreements

A case for soft loss functions