Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Conceptmath: A bilingual concept-wise benchmark for measuring mathematical reasoning of large language models
This paper introduces ConceptMath, a bilingual (English and Chinese), fine-grained
benchmark that evaluates concept-wise mathematical reasoning of Large Language Models …
benchmark that evaluates concept-wise mathematical reasoning of Large Language Models …
Cfbench: A comprehensive constraints-following benchmark for llms
The adeptness of Large Language Models (LLMs) in comprehending and following natural
language instructions is critical for their deployment in sophisticated real-world applications …
language instructions is critical for their deployment in sophisticated real-world applications …
Survey of cultural awareness in language models: Text and beyond
Large-scale deployment of large language models (LLMs) in various applications, such as
chatbots and virtual assistants, requires LLMs to be culturally sensitive to the user to ensure …
chatbots and virtual assistants, requires LLMs to be culturally sensitive to the user to ensure …
WenMind: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Classical Literature and Language Arts
Abstract Large Language Models (LLMs) have made significant advancements across
numerous domains, but their capabilities in Chinese Classical Literature and Language Arts …
numerous domains, but their capabilities in Chinese Classical Literature and Language Arts …
CS4: Measuring the Creativity of Large Language Models Automatically by Controlling the Number of Story-Writing Constraints
Evaluating the creativity of large language models (LLMs) in story writing is difficult because
LLM-generated stories could seemingly look creative but be very similar to some existing …
LLM-generated stories could seemingly look creative but be very similar to some existing …
IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization
In the realm of large language models (LLMs), the ability of models to accurately follow
instructions is paramount as more agents and applications leverage LLMs for construction …
instructions is paramount as more agents and applications leverage LLMs for construction …
Latent Learningscape Guided In-context Learning
The growing interest in leveraging large language models is driven by their exceptional
imitation and reasoning capabilities. In-context learning (ICL), a streamlined method, has …
imitation and reasoning capabilities. In-context learning (ICL), a streamlined method, has …