A survey on large language models for software engineering

Q Zhang, C Fang, Y **e, Y Zhang, Y Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
Software Engineering (SE) is the systematic design, development, maintenance, and
management of software applications underpinning the digital infrastructure of our modern …

Rethinking membership inference attacks against transfer learning

C Wu, J Chen, Q Fang, K He, Z Zhao… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Transfer learning, successful in knowledge translation across related tasks, faces a
substantial privacy threat from membership inference attacks (MIAs). These attacks, despite …

It's All in the Touch: Authenticating Users with HOST Gestures on Multi-Touch Screen Devices

C Wu, H Cao, G Xu, C Zhou, J Sun… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
As smartphones proliferate, secure and user-friendly authentication methods are
increasingly critical. Existing behavioral biometrics, however, are often compromised by …

Source code summarization in the era of large language models

W Sun, Y Miao, Y Li, H Zhang, C Fang, Y Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
To support software developers in understanding and maintaining programs, various
automatic (source) code summarization techniques have been proposed to generate a …

Vulseye: Detect smart contract vulnerabilities via stateful directed graybox fuzzing

R Liang, J Chen, C Wu, K He, Y Wu… - IEEE Transactions …, 2025 - ieeexplore.ieee.org
Smart contracts, the cornerstone of decentralized applications, have become increasingly
prominent in revolutionizing the digital landscape. However, vulnerabilities in smart …

A catalog of data smells for coding tasks

A Vitale, R Oliveto, S Scalabrino - ACM Transactions on Software …, 2024 - dl.acm.org
Large Language Models (LLMs) are increasingly becoming fundamental in supporting
software developers in coding tasks. The massive datasets used for training LLMs are often …

Promises and perils of using Transformer-based models for SE research

Y **ao, X Zuo, X Lu, JS Dong, X Cao, I Beschastnikh - Neural Networks, 2024 - Elsevier
Many Transformer-based pre-trained models for code have been developed and applied to
code-related tasks. In this paper, we analyze 519 papers published on this topic during 2017 …

Towards cost-efficient vulnerability detection with cross-modal adversarial reprogramming

Z Tian, R Qiu, Y Teng, J Sun, Y Chen, L Chen - Journal of Systems and …, 2025 - Elsevier
While deep learning has advanced the automatic detection of software vulnerabilities,
current DL-based methods still face two major obstacles: the scarcity of vulnerable code …

Resource-Efficient & Effective Code Summarization

S Afrin, J Call, KN Nguyen, O Chaparro… - arxiv preprint arxiv …, 2025 - arxiv.org
Code Language Models (CLMs) have demonstrated high effectiveness in automating
software engineering tasks such as bug fixing, code generation, and code documentation …

Beyond Dataset Watermarking: Model-Level Copyright Protection for Code Summarization Models

J Zhang, H Li, D Wu, X Sun, Q Lu, G Long - arxiv preprint arxiv …, 2024 - arxiv.org
Code Summarization Model (CSM) has been widely used in code production, such as
online and web programming for PHP and Javascript. CSMs are essential tools in code …