Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A systematic survey of text summarization: From statistical methods to large language models
Text summarization research has undergone several significant transformations with the
advent of deep neural networks, pre-trained language models (PLMs), and recent large …
advent of deep neural networks, pre-trained language models (PLMs), and recent large …
Minicheck: Efficient fact-checking of llms on grounding documents
Recognizing if LLM output can be grounded in evidence is central to many tasks in NLP:
retrieval-augmented generation, summarization, document-grounded dialogue, and more …
retrieval-augmented generation, summarization, document-grounded dialogue, and more …
FineSurE: Fine-grained summarization evaluation using LLMs
Automated evaluation is crucial for streamlining text summarization benchmarking and
model development, given the costly and time-consuming nature of human evaluation …
model development, given the costly and time-consuming nature of human evaluation …
[HTML][HTML] Factual consistency evaluation of summarization in the Era of large language models
Factual inconsistency with source documents in automatically generated summaries can
lead to misinformation or pose risks. Existing factual consistency (FC) metrics are …
lead to misinformation or pose risks. Existing factual consistency (FC) metrics are …
Tofueval: Evaluating hallucinations of llms on topic-focused dialogue summarization
Single document news summarization has seen substantial progress on faithfulness in
recent years, driven by research on the evaluation of factual consistency, or hallucinations …
recent years, driven by research on the evaluation of factual consistency, or hallucinations …
Instructing and prompting large language models for explainable cross-domain recommendations
In this paper, we present a strategy to provide users with explainable cross-domain
recommendations (CDR) that exploits large language models (LLMs). Generally speaking …
recommendations (CDR) that exploits large language models (LLMs). Generally speaking …
Checkeval: Robust evaluation framework using large language model via checklist
We introduce CheckEval, a novel evaluation framework using Large Language Models,
addressing the challenges of ambiguity and inconsistency in current evaluation methods …
addressing the challenges of ambiguity and inconsistency in current evaluation methods …
An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities
Code generation aims to automatically generate code snippets of specific programming
language according to natural language descriptions. The continuous advancements in …
language according to natural language descriptions. The continuous advancements in …
Genaudit: Fixing factual errors in language model outputs with evidence
LLMs can generate factually incorrect statements even when provided access to reference
documents. Such errors can be dangerous in high-stakes applications (eg, document …
documents. Such errors can be dangerous in high-stakes applications (eg, document …
Investigating hallucinations in pruned large language models for abstractive summarization
Despite the remarkable performance of generative large language models (LLMs) on
abstractive summarization, they face two significant challenges: their considerable size and …
abstractive summarization, they face two significant challenges: their considerable size and …