Deep learning based vulnerability detection: Are we there yet?

S Chakraborty, R Krishna, Y Ding… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Automated detection of software vulnerabilities is a fundamental problem in software
security. Existing program analysis techniques either suffer from high false positives or false …

Palm: Scaling language modeling with pathways

A Chowdhery, S Narang, J Devlin, M Bosma… - Journal of Machine …, 2023 - jmlr.org
Large language models have been shown to achieve remarkable performance across a
variety of natural language tasks using few-shot learning, which drastically reduces the …

Is your code generated by chatgpt really correct? rigorous evaluation of large language models for code generation

J Liu, CS **a, Y Wang, L Zhang - Advances in Neural …, 2024 - proceedings.neurips.cc
Program synthesis has been long studied with recent approaches focused on directly using
the power of Large Language Models (LLMs) to generate code. Programming benchmarks …

Unified pre-training for program understanding and generation

WU Ahmad, S Chakraborty, B Ray… - arxiv preprint arxiv …, 2021 - arxiv.org
Code summarization and generation empower conversion between programming language
(PL) and natural language (NL), while code translation avails the migration of legacy code …

Gorilla: Large language model connected with massive apis

SG Patil, T Zhang, X Wang, JE Gonzalez - arxiv preprint arxiv:2305.15334, 2023 - arxiv.org
Large Language Models (LLMs) have seen an impressive wave of advances recently, with
models now excelling in a variety of tasks, such as mathematical reasoning and program …

Codexglue: A machine learning benchmark dataset for code understanding and generation

S Lu, D Guo, S Ren, J Huang, A Svyatkovskiy… - arxiv preprint arxiv …, 2021 - arxiv.org
Benchmark datasets have a significant impact on accelerating research in programming
language tasks. In this paper, we introduce CodeXGLUE, a benchmark dataset to foster …

Code generation using machine learning: A systematic review

E Dehaerne, B Dey, S Halder, S De Gendt… - Ieee …, 2022 - ieeexplore.ieee.org
Recently, machine learning (ML) methods have been used to create powerful language
models for a broad range of natural language processing tasks. An important subset of this …

Measuring coding challenge competence with apps

D Hendrycks, S Basart, S Kadavath, M Mazeika… - arxiv preprint arxiv …, 2021 - arxiv.org
While programming is one of the most broadly applicable skills in modern society, modern
machine learning models still cannot code solutions to basic problems. Despite its …

Unsolved problems in ml safety

D Hendrycks, N Carlini, J Schulman… - arxiv preprint arxiv …, 2021 - arxiv.org
Machine learning (ML) systems are rapidly increasing in size, are acquiring new
capabilities, and are increasingly deployed in high-stakes settings. As with other powerful …

Codebleu: a method for automatic evaluation of code synthesis

S Ren, D Guo, S Lu, L Zhou, S Liu, D Tang… - arxiv preprint arxiv …, 2020 - arxiv.org
Evaluation metrics play a vital role in the growth of an area as it defines the standard of
distinguishing between good and bad models. In the area of code synthesis, the commonly …