The stack: 3 tb of permissively licensed source code
Large Language Models (LLMs) play an ever-increasing role in the field of Artificial
Intelligence (AI)--not only for natural language processing but also for code understanding …
Intelligence (AI)--not only for natural language processing but also for code understanding …
SantaCoder: don't reach for the stars!
The BigCode project is an open-scientific collaboration working on the responsible
development of large language models for code. This tech report describes the progress of …
development of large language models for code. This tech report describes the progress of …
Octopack: Instruction tuning code large language models
Finetuning large language models (LLMs) on instructions leads to vast performance
improvements on natural language tasks. We apply instruction tuning using code …
improvements on natural language tasks. We apply instruction tuning using code …
MultiPL-E: a scalable and polyglot approach to benchmarking neural code generation
Large language models have demonstrated the ability to generate both natural language
and programming language text. Although contemporary code generation models are …
and programming language text. Although contemporary code generation models are …
Structured chain-of-thought prompting for code generation
Large Language Models (LLMs) have shown impressive abilities in code generation. Chain-
of-Thought (CoT) prompting is the state-of-the-art approach to utilizing LLMs. CoT prompting …
of-Thought (CoT) prompting is the state-of-the-art approach to utilizing LLMs. CoT prompting …
Exploring parameter-efficient fine-tuning techniques for code generation with large language models
Large language models (LLMs) demonstrate impressive capabilities to generate accurate
code snippets given natural language intents in a zero-shot manner, ie, without the need for …
code snippets given natural language intents in a zero-shot manner, ie, without the need for …
Crosscodeeval: A diverse and multilingual benchmark for cross-file code completion
Code completion models have made significant progress in recent years, yet current popular
evaluation datasets, such as HumanEval and MBPP, predominantly focus on code …
evaluation datasets, such as HumanEval and MBPP, predominantly focus on code …
Classeval: A manually-crafted benchmark for evaluating llms on class-level code generation
In this work, we make the first attempt to evaluate LLMs in a more challenging code
generation scenario, ie class-level code generation. We first manually construct the first …
generation scenario, ie class-level code generation. We first manually construct the first …
[PDF][PDF] Exploring the effectiveness of large language models in generating unit tests
A code generation model generates code by taking a prompt from a code comment, existing
code, or a combination of both. Although code generation models (eg, GitHub Copilot) are …
code, or a combination of both. Although code generation models (eg, GitHub Copilot) are …
A survey of large language models for code: Evolution, benchmarking, and future trends
General large language models (LLMs), represented by ChatGPT, have demonstrated
significant potential in tasks such as code generation in software engineering. This has led …
significant potential in tasks such as code generation in software engineering. This has led …