Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
SESCORE2: Learning text generation evaluation via synthesizing realistic mistakes
Is it possible to train a general metric for evaluating text generation quality without human
annotated ratings? Existing learned metrics either perform unsatisfactorily across text …
annotated ratings? Existing learned metrics either perform unsatisfactorily across text …
Multilingual conceptual coverage in text-to-image models
We propose" Conceptual Coverage Across Languages"(CoCo-CroLa), a technique for
benchmarking the degree to which any generative text-to-image system provides …
benchmarking the degree to which any generative text-to-image system provides …
A review of faithfulness metrics for hallucination assessment in Large Language Models
B Malin, T Kalganova, N Boulgouris - arxiv preprint arxiv:2501.00269, 2024 - arxiv.org
This review examines the means with which faithfulness has been evaluated across open-
ended summarization, question-answering and machine translation tasks. We find that the …
ended summarization, question-answering and machine translation tasks. We find that the …
Towards fine-grained information: Identifying the type and location of translation errors
Fine-grained information on translation errors is helpful for the translation evaluation
community. Existing approaches can not synchronously consider error position and type …
community. Existing approaches can not synchronously consider error position and type …
Translation Canvas: An Explainable Interface to Pinpoint and Analyze Translation Systems
With the rapid advancement of machine translation research, evaluation toolkits have
become essential for benchmarking system progress. Tools like COMET and SacreBLEU …
become essential for benchmarking system progress. Tools like COMET and SacreBLEU …
Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Models
Generative large language models (LLMs), eg, ChatGPT, have demonstrated remarkable
proficiency across several NLP tasks, such as machine translation, text summarization …
proficiency across several NLP tasks, such as machine translation, text summarization …