Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey of machine learning for big code and naturalness
Research at the intersection of machine learning, programming languages, and software
engineering has recently taken important steps in proposing learnable probabilistic models …
engineering has recently taken important steps in proposing learnable probabilistic models …
A survey on deep graph generation: Methods and applications
Graphs are ubiquitous in encoding relational information of real-world objects in many
domains. Graph generation, whose purpose is to generate new graphs from a distribution …
domains. Graph generation, whose purpose is to generate new graphs from a distribution …
Codexglue: A machine learning benchmark dataset for code understanding and generation
Benchmark datasets have a significant impact on accelerating research in programming
language tasks. In this paper, we introduce CodeXGLUE, a benchmark dataset to foster …
language tasks. In this paper, we introduce CodeXGLUE, a benchmark dataset to foster …
Longcoder: A long-range pre-trained language model for code completion
In this paper, we introduce a new task for code completion that focuses on handling long
code input and propose a sparse Transformer model, called LongCoder, to address this …
code input and propose a sparse Transformer model, called LongCoder, to address this …
Repobench: Benchmarking repository-level code auto-completion systems
Large Language Models (LLMs) have greatly advanced code auto-completion systems, with
a potential for substantial productivity enhancements for developers. However, current …
a potential for substantial productivity enhancements for developers. However, current …
No need to lift a finger anymore? assessing the quality of code generation by chatgpt
Large language models (LLMs) have demonstrated impressive capabilities across various
natural language processing (NLP) tasks, such as machine translation, question answering …
natural language processing (NLP) tasks, such as machine translation, question answering …
code2vec: Learning distributed representations of code
We present a neural model for representing snippets of code as continuous distributed
vectors (``code embeddings''). The main idea is to represent a code snippet as a single fixed …
vectors (``code embeddings''). The main idea is to represent a code snippet as a single fixed …
Is github's copilot as bad as humans at introducing vulnerabilities in code?
Several advances in deep learning have been successfully applied to the software
development process. Of recent interest is the use of neural language models to build tools …
development process. Of recent interest is the use of neural language models to build tools …
code2seq: Generating sequences from structured representations of code
The ability to generate natural language sequences from source code snippets has a variety
of applications such as code summarization, documentation, and retrieval. Sequence-to …
of applications such as code summarization, documentation, and retrieval. Sequence-to …
Unifying the perspectives of nlp and software engineering: A survey on language models for code
Z Zhang, C Chen, B Liu, C Liao, Z Gong, H Yu… - arxiv preprint arxiv …, 2023 - arxiv.org
In this work we systematically review the recent advancements in software engineering with
language models, covering 70+ models, 40+ evaluation tasks, 180+ datasets, and 900 …
language models, covering 70+ models, 40+ evaluation tasks, 180+ datasets, and 900 …