Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Teach me to explain: A review of datasets for explainable natural language processing
S Wiegreffe, A Marasović - arxiv preprint arxiv:2102.12060, 2021 - arxiv.org
Explainable NLP (ExNLP) has increasingly focused on collecting human-annotated textual
explanations. These explanations are used downstream in three ways: as data …
explanations. These explanations are used downstream in three ways: as data …
The belebele benchmark: a parallel reading comprehension dataset in 122 language variants
We present Belebele, a multiple-choice machine reading comprehension (MRC) dataset
spanning 122 language variants. Significantly expanding the language coverage of natural …
spanning 122 language variants. Significantly expanding the language coverage of natural …
What will it take to fix benchmarking in natural language understanding?
Evaluation for many natural language understanding (NLU) tasks is broken: Unreliable and
biased systems score so highly on standard benchmarks that there is little room for …
biased systems score so highly on standard benchmarks that there is little room for …
WANLI: Worker and AI collaboration for natural language inference dataset creation
A recurring challenge of crowdsourcing NLP datasets at scale is that human writers often
rely on repetitive patterns when crafting examples, leading to a lack of linguistic diversity. We …
rely on repetitive patterns when crafting examples, leading to a lack of linguistic diversity. We …
IMPLI: Investigating NLI models' performance on figurative language
Natural language inference (NLI) has been widely used as a task to train and evaluate
models for language understanding. However, the ability of NLI models to perform …
models for language understanding. However, the ability of NLI models to perform …
Issues with entailment-based zero-shot text classification
The general format of natural language inference (NLI) makes it tempting to be used for zero-
shot text classification by casting any target label into a sentence of hypothesis and verifying …
shot text classification by casting any target label into a sentence of hypothesis and verifying …
Models in the loop: Aiding crowdworkers with generative annotation assistants
In Dynamic Adversarial Data Collection (DADC), human annotators are tasked with finding
examples that models struggle to predict correctly. Models trained on DADC-collected …
examples that models struggle to predict correctly. Models trained on DADC-collected …
Analyzing dynamic adversarial training data in the limit
To create models that are robust across a wide range of test inputs, training datasets should
include diverse examples that span numerous phenomena. Dynamic adversarial data …
include diverse examples that span numerous phenomena. Dynamic adversarial data …
Fool me twice: Entailment from wikipedia gamification
We release FoolMeTwice (FM2 for short), a large dataset of challenging entailment pairs
collected through a fun multi-player game. Gamification encourages adversarial examples …
collected through a fun multi-player game. Gamification encourages adversarial examples …
Farstail: A persian natural language inference dataset
With the considerable achievements of data-hungry deep learning methods in natural
language processing tasks, a great amount of effort has been devoted to develop more …
language processing tasks, a great amount of effort has been devoted to develop more …