Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Repairing the cracked foundation: A survey of obstacles in evaluation practices for generated text
Abstract Evaluation practices in natural language generation (NLG) have many known flaws,
but improved evaluation approaches are rarely widely adopted. This issue has become …
but improved evaluation approaches are rarely widely adopted. This issue has become …
An integrative survey on mental health conversational agents to bridge computer science and medical perspectives
Mental health conversational agents (aka chatbots) are widely studied for their potential to
offer accessible support to those experiencing mental health challenges. Previous surveys …
offer accessible support to those experiencing mental health challenges. Previous surveys …
[PDF][PDF] Ai transparency in the age of llms: A human-centered research roadmap
QV Liao, JW Vaughan - ar** Norwegian salmon: An inventory of pitfalls in fairness benchmark datasets
Auditing NLP systems for computational harms like surfacing stereotypes is an elusive goal.
Several recent efforts have focused on benchmark datasets consisting of pairs of contrastive …
Several recent efforts have focused on benchmark datasets consisting of pairs of contrastive …
" I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset
As language models grow in popularity, it becomes increasingly important to clearly
measure all possible markers of demographic identity in order to avoid perpetuating existing …
measure all possible markers of demographic identity in order to avoid perpetuating existing …
Evaluation of text generation: A survey
The paper surveys evaluation methods of natural language generation (NLG) systems that
have been developed in the last few years. We group NLG evaluation methods into three …
have been developed in the last few years. We group NLG evaluation methods into three …
Measuring attribution in natural language generation models
Large neural models have brought a new challenge to natural language generation (NLG): It
has become imperative to ensure the safety and reliability of the output of models that …
has become imperative to ensure the safety and reliability of the output of models that …
Is GPT-3 text indistinguishable from human text? scarecrow: A framework for scrutinizing machine text
Y Dou, M Forbes, R Koncel-Kedziorski… - arxiv preprint arxiv …, 2021 - arxiv.org
Modern neural language models can produce remarkably fluent and grammatical text. So
much, in fact, that recent work by Clark et al.(2021) has reported that conventional …
much, in fact, that recent work by Clark et al.(2021) has reported that conventional …
The perils of using Mechanical Turk to evaluate open-ended text generation
Recent text generation research has increasingly focused on open-ended domains such as
story and poetry generation. Because models built for such tasks are difficult to evaluate …
story and poetry generation. Because models built for such tasks are difficult to evaluate …