Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Pre-trained language models in medicine: A survey
With the rapid progress in Natural Language Processing (NLP), Pre-trained Language
Models (PLM) such as BERT, BioBERT, and ChatGPT have shown great potential in various …
Models (PLM) such as BERT, BioBERT, and ChatGPT have shown great potential in various …
COMET-22: Unbabel-IST 2022 submission for the metrics shared task
In this paper, we present the joint contribution of Unbabel and IST to the WMT 2022 Metrics
Shared Task. Our primary submission–dubbed COMET-22–is an ensemble between a …
Shared Task. Our primary submission–dubbed COMET-22–is an ensemble between a …
xcomet: Transparent Machine Translation Evaluation through Fine-grained Error Detection
Widely used learned metrics for machine translation evaluation, such as Comet and Bleurt,
estimate the quality of a translation hypothesis by providing a single sentence-level score …
estimate the quality of a translation hypothesis by providing a single sentence-level score …
Error analysis prompting enables human-like translation evaluation in large language models
Generative large language models (LLMs), eg, ChatGPT, have demonstrated remarkable
proficiency across several NLP tasks, such as machine translation, text summarization …
proficiency across several NLP tasks, such as machine translation, text summarization …
System combination via quality estimation for grammatical error correction
Quality estimation models have been developed to assess the corrections made by
grammatical error correction (GEC) models when the reference or gold-standard corrections …
grammatical error correction (GEC) models when the reference or gold-standard corrections …
Instructscore: Explainable text generation evaluation with finegrained feedback
Automatically evaluating the quality of language generation is critical. Although recent
learned metrics show high correlation with human judgement, these metrics can not explain …
learned metrics show high correlation with human judgement, these metrics can not explain …
The devil is in the errors: Leveraging large language models for fine-grained machine translation evaluation
Automatic evaluation of machine translation (MT) is a critical tool driving the rapid iterative
development of MT systems. While considerable progress has been made on estimating a …
development of MT systems. While considerable progress has been made on estimating a …
Understanding and detecting hallucinations in neural machine translation via model introspection
Neural sequence generation models are known to “hallucinate”, by producing outputs that
are unrelated to the source text. These hallucinations are potentially harmful, yet it remains …
are unrelated to the source text. These hallucinations are potentially harmful, yet it remains …
Efficient benchmarking of language models
The increasing versatility of language models (LMs) has given rise to a new class of
benchmarks that comprehensively assess a broad range of capabilities. Such benchmarks …
benchmarks that comprehensively assess a broad range of capabilities. Such benchmarks …
Towards making the most of llm for translation quality estimation
Abstract Machine Translation Quality Estimation (QE) aims to evaluate the quality of
machine translation without relying on references. Recently, Large-scale Language Model …
machine translation without relying on references. Recently, Large-scale Language Model …