Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey on evaluation of large language models
Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …
industry, owing to their unprecedented performance in various applications. As LLMs …
[PDF][PDF] Large language models: a comprehensive survey of its applications, challenges, limitations, and future prospects
Within the vast expanse of computerized language processing, a revolutionary entity known
as Large Language Models (LLMs) has emerged, wielding immense power in its capacity to …
as Large Language Models (LLMs) has emerged, wielding immense power in its capacity to …
[PDF][PDF] A survey of large language models
Ever since the Turing Test was proposed in the 1950s, humans have explored the mastering
of language intelligence by machine. Language is essentially a complex, intricate system of …
of language intelligence by machine. Language is essentially a complex, intricate system of …
[PDF][PDF] Agentverse: Facilitating multi-agent collaboration and exploring emergent behaviors in agents
Autonomous agents empowered by Large Language Models (LLMs) have undergone
significant improvements, enabling them to generalize across a broad spectrum of tasks …
significant improvements, enabling them to generalize across a broad spectrum of tasks …
Evaluating large language models at evaluating instruction following
As research in large language models (LLMs) continues to accelerate, LLM-based
evaluation has emerged as a scalable and cost-effective alternative to human evaluations …
evaluation has emerged as a scalable and cost-effective alternative to human evaluations …
Datasets for large language models: A comprehensive survey
This paper embarks on an exploration into the Large Language Model (LLM) datasets,
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …
Generative judge for evaluating alignment
The rapid development of Large Language Models (LLMs) has substantially expanded the
range of tasks they can address. In the field of Natural Language Processing (NLP) …
range of tasks they can address. In the field of Natural Language Processing (NLP) …
Llm-based nlg evaluation: Current status and challenges
Evaluating natural language generation (NLG) is a vital but challenging problem in artificial
intelligence. Traditional evaluation metrics mainly capturing content (eg n-gram) overlap …
intelligence. Traditional evaluation metrics mainly capturing content (eg n-gram) overlap …
Universal self-consistency for large language model generation
Self-consistency with chain-of-thought prompting (CoT) has demonstrated remarkable
performance gains on various challenging tasks, by utilizing multiple reasoning paths …
performance gains on various challenging tasks, by utilizing multiple reasoning paths …
Branch-solve-merge improves large language model evaluation and generation
Large Language Models (LLMs) are frequently used for multi-faceted language generation
and evaluation tasks that involve satisfying intricate user constraints or taking into account …
and evaluation tasks that involve satisfying intricate user constraints or taking into account …