Google Učenjak

Članki

Učenjak

Približno 443 rez. (0,04 s)

Dynabench: Rethinking benchmarking in NLP

D Kiela, M Bartolo, Y Nie, D Kaushik, A Geiger… - arxiv preprint arxiv …, 2021 - arxiv.org

We introduce Dynabench, an open-source platform for dynamic dataset creation and model
benchmarking. Dynabench runs in a web browser and supports human-and-model-in-the …

Shrani Navedi Navedeno v 437 virih Sorodni članki Vse različice: 8 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Klue: Korean language understanding evaluation

S Park, J Moon, S Kim, WI Cho, J Han, J Park… - arxiv preprint arxiv …, 2021 - arxiv.org

We introduce Korean Language Understanding Evaluation (KLUE) benchmark. KLUE is a
collection of 8 Korean natural language understanding (NLU) tasks, including Topic …

Shrani Navedi Navedeno v 316 virih Sorodni članki Vse različice: 7 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Bert-attack: Adversarial attack against bert using bert

L Li, R Ma, Q Guo, X Xue, X Qiu - arxiv preprint arxiv:2004.09984, 2020 - arxiv.org

Adversarial attacks for discrete data (such as texts) have been proved significantly more
challenging than continuous data (such as images) since it is difficult to generate adversarial …

Shrani Navedi Navedeno v 729 virih Sorodni članki Vse različice: 7 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Adversarial NLI: A new benchmark for natural language understanding

Y Nie, A Williams, E Dinan, M Bansal, J Weston… - arxiv preprint arxiv …, 2019 - arxiv.org

We introduce a new large-scale NLI benchmark dataset, collected via an iterative,
adversarial human-and-model-in-the-loop procedure. We show that training models on this …

Shrani Navedi Navedeno v 1029 virih Sorodni članki Vse različice: 9 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Hellaswag: Can a machine really finish your sentence?

R Zellers, A Holtzman, Y Bisk, A Farhadi… - arxiv preprint arxiv …, 2019 - arxiv.org

Recent work by Zellers et al.(2018) introduced a new task of commonsense natural
language inference: given an event description such as" A woman sits at a piano," a …

Shrani Navedi Navedeno v 2049 virih Sorodni članki Vse različice: 4 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Boolq: Exploring the surprising difficulty of natural yes/no questions

C Clark, K Lee, MW Chang, T Kwiatkowski… - arxiv preprint arxiv …, 2019 - arxiv.org

In this paper we study yes/no questions that are naturally occurring---meaning that they are
generated in unprompted and unconstrained settings. We build a reading comprehension …

Shrani Navedi Navedeno v 1363 virih Sorodni članki Vse različice: 6 V obliki HTML

Navedi

Napredno iskanje

Shranjeno v Mojo knjižnico

Dynabench: Rethinking benchmarking in NLP

Klue: Korean language understanding evaluation

Bert-attack: Adversarial attack against bert using bert

Adversarial NLI: A new benchmark for natural language understanding

Hellaswag: Can a machine really finish your sentence?

Boolq: Exploring the surprising difficulty of natural yes/no questions