Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Livecodebench: Holistic and contamination free evaluation of large language models for code
Large Language Models (LLMs) applied to code-related applications have emerged as a
prominent field, attracting significant interest from both academia and industry. However, as …
prominent field, attracting significant interest from both academia and industry. However, as …
Cruxeval: A benchmark for code reasoning, understanding and execution
We present CRUXEval (Code Reasoning, Understanding, and eXecution Evaluation), a
benchmark consisting of 800 Python functions (3-13 lines). Each function comes with an …
benchmark consisting of 800 Python functions (3-13 lines). Each function comes with an …
Transformers in source code generation: A comprehensive survey
Transformers have revolutionized natural language processing (NLP) and have had a huge
impact on automating tasks. Recently, transformers have led to the development of powerful …
impact on automating tasks. Recently, transformers have led to the development of powerful …
Multilingual training for software engineering
Well-trained machine-learning models, which leverage large amounts of open-source
software data, have now become an interesting approach to automating many software …
software data, have now become an interesting approach to automating many software …
A catalog of data smells for coding tasks
Large Language Models (LLMs) are increasingly becoming fundamental in supporting
software developers in coding tasks. The massive datasets used for training LLMs are often …
software developers in coding tasks. The massive datasets used for training LLMs are often …
Formal specifications from natural language
We study the generalization abilities of language models when translating natural language
into formal specifications with complex semantics. In particular, we fine-tune language …
into formal specifications with complex semantics. In particular, we fine-tune language …
Effibench: Benchmarking the efficiency of automatically generated code
Code generation models have increasingly become integral to aiding software
development. Although current research has thoroughly examined the correctness of the …
development. Although current research has thoroughly examined the correctness of the …
The counterfeit conundrum: Can code language models grasp the nuances of their incorrect generations?
While language models are increasingly more proficient at code generation, they still
frequently generate incorrect programs. Many of these programs are obviously wrong, but …
frequently generate incorrect programs. Many of these programs are obviously wrong, but …
Mhpp: Exploring the capabilities and limitations of language models beyond basic code generation
Recent advancements in large language models (LLMs) have greatly improved code
generation, specifically at the function level. For instance, GPT-4o has achieved a 91.0 …
generation, specifically at the function level. For instance, GPT-4o has achieved a 91.0 …
The vault: A comprehensive multilingual dataset for advancing code understanding and generation
We present The Vault, a dataset of high-quality code-text pairs in multiple programming
languages for training large language models to understand and generate code. We present …
languages for training large language models to understand and generate code. We present …