Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
[HTML][HTML] Summary of chatgpt-related research and perspective towards the future of large language models
This paper presents a comprehensive survey of ChatGPT-related (GPT-3.5 and GPT-4)
research, state-of-the-art large language models (LLM) from the GPT series, and their …
research, state-of-the-art large language models (LLM) from the GPT series, and their …
Llm-based nlg evaluation: Current status and challenges
Evaluating natural language generation (NLG) is a vital but challenging problem in artificial
intelligence. Traditional evaluation metrics mainly capturing content (eg n-gram) overlap …
intelligence. Traditional evaluation metrics mainly capturing content (eg n-gram) overlap …
Palm 2 technical report
We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and
reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is …
reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is …
How good are gpt models at machine translation? a comprehensive evaluation
Generative Pre-trained Transformer (GPT) models have shown remarkable capabilities for
natural language generation, but their performance for machine translation has not been …
natural language generation, but their performance for machine translation has not been …
Towards making the most of chatgpt for machine translation
ChatGPT shows remarkable capabilities for machine translation (MT). Several prior studies
have shown that it achieves comparable results to commercial systems for high-resource …
have shown that it achieves comparable results to commercial systems for high-resource …
Large language models are state-of-the-art evaluators of translation quality
We describe GEMBA, a GPT-based metric for assessment of translation quality, which works
both with a reference translation and without. In our evaluation, we focus on zero-shot …
both with a reference translation and without. In our evaluation, we focus on zero-shot …
COMET-22: Unbabel-IST 2022 submission for the metrics shared task
In this paper, we present the joint contribution of Unbabel and IST to the WMT 2022 Metrics
Shared Task. Our primary submission–dubbed COMET-22–is an ensemble between a …
Shared Task. Our primary submission–dubbed COMET-22–is an ensemble between a …
xcomet: Transparent Machine Translation Evaluation through Fine-grained Error Detection
Widely used learned metrics for machine translation evaluation, such as Comet and Bleurt,
estimate the quality of a translation hypothesis by providing a single sentence-level score …
estimate the quality of a translation hypothesis by providing a single sentence-level score …
Error analysis prompting enables human-like translation evaluation in large language models
Generative large language models (LLMs), eg, ChatGPT, have demonstrated remarkable
proficiency across several NLP tasks, such as machine translation, text summarization …
proficiency across several NLP tasks, such as machine translation, text summarization …
Exploring human-like translation strategy with large language models
Large language models (LLMs) have demonstrated impressive capabilities in general
scenarios, exhibiting a level of aptitude that approaches, in some aspects even surpasses …
scenarios, exhibiting a level of aptitude that approaches, in some aspects even surpasses …