Data quality for software vulnerability datasets
The use of learning-based techniques to achieve automated software vulnerability detection
has been of longstanding interest within the software security domain. These data-driven …
has been of longstanding interest within the software security domain. These data-driven …
Automating code-related tasks through transformers: The impact of pre-training
Transformers have gained popularity in the software engineering (SE) literature. These deep
learning models are usually pre-trained through a self-supervised objective, meant to …
learning models are usually pre-trained through a self-supervised objective, meant to …
Vulnerabilities and Security Patches Detection in OSS: A Survey
R Lin, Y Fu, W Yi, J Yang, J Cao, Z Dong, F **e… - ACM Computing …, 2024 - dl.acm.org
Over the past decade, Open Source Software (OSS) has experienced rapid growth and
widespread adoption, attributed to its openness and editability. However, this expansion has …
widespread adoption, attributed to its openness and editability. However, this expansion has …
Invalidator: Automated patch correctness assessment via semantic and syntactic reasoning
Automated program repair (APR) faces the challenge of test overfitting, where generated
patches pass validation tests but fail to generalize. Existing methods for patch assessment …
patches pass validation tests but fail to generalize. Existing methods for patch assessment …
Enhancing security patch identification by capturing structures in commits
With the rapid increasing number of open source software (OSS), the majority of the software
vulnerabilities in the open source components are fixed silently, which leads to the deployed …
vulnerabilities in the open source components are fixed silently, which leads to the deployed …
Ccrep: Learning code change representations via pre-trained code model and query back
Representing code changes as numeric feature vectors, ie, code change representations, is
usually an essential step to automate many software engineering tasks related to code …
usually an essential step to automate many software engineering tasks related to code …
The devil is in the tails: How long-tailed code distributions impact large language models
Learning-based techniques, especially advanced Large Language Models (LLMs) for code,
have gained considerable popularity in various software engineering (SE) tasks. However …
have gained considerable popularity in various software engineering (SE) tasks. However …
Fine-grained commit-level vulnerability type prediction by CWE tree structure
Identifying security patches via code commits to allow early warnings and timely fixes for
Open Source Software (OSS) has received increasing attention. However, the existing …
Open Source Software (OSS) has received increasing attention. However, the existing …
Secureqwen: Leveraging llms for vulnerability detection in python codebases
Identifying vulnerabilities in software code is crucial for ensuring the security of modern
systems. However, manual detection requires expert knowledge and is time-consuming …
systems. However, manual detection requires expert knowledge and is time-consuming …
PatchFinder: A two-phase approach to security patch tracing for disclosed vulnerabilities in open-source software
Open-source software (OSS) vulnerabilities are increasingly prevalent, emphasizing the
importance of security patches. However, in widely used security platforms like NVD, a …
importance of security patches. However, in widely used security platforms like NVD, a …