Agents in software engineering: Survey, landscape, and vision

Y Wang, W Zhong, Y Huang, E Shi, M Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
In recent years, Large Language Models (LLMs) have achieved remarkable success and
have been widely used in various downstream tasks, especially in the tasks of the software …

SECON: Maintaining semantic consistency in data augmentation for code search

X Zhang, Z Lin, X Hu, J Wang, W Lu… - ACM Transactions on …, 2025 - dl.acm.org
Efficient code search techniques are crucial in accelerating software development by aiding
developers in locating specific code snippets and understanding code functionalities. This …

Large language model ChatGPT versus small deep learning models for self‐admitted technical debt detection: Why not together?

J Li, L Li, J Liu, X Yu, X Liu… - Software: Practice and …, 2025 - Wiley Online Library
Given the increasing complexity and volume of Self‐Admitted Technical Debts (SATDs), how
to efficiently detect them becomes critical in software engineering practice for improving …

Prompt-based Code Completion via Multi-Retrieval Augmented Generation

H Tan, Q Luo, L Jiang, Z Zhan, J Li, H Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
Automated code completion, aiming at generating subsequent tokens from unfinished code,
has been significantly benefited from recent progress in pre-trained Large Language Models …

An empirical study of code search in intelligent coding assistant: Perceptions, expectations, and directions

C Liu, X Zhang, H Zhang, Z Wan, Z Huang… - … Proceedings of the 32nd …, 2024 - dl.acm.org
Code search plays an important role in enhancing the productivity of software developers.
Throughout the years, numerous code search tools have been developed and widely …

Hyperbolic code retrieval: a novel approach for efficient code search using hyperbolic space embeddings

X Tang, S Ezzini, H Tian, Y Song, J Klein… - arxiv preprint arxiv …, 2023 - arxiv.org
Within the realm of advanced code retrieval, existing methods have primarily relied on
intricate matching and attention-based mechanisms. However, these methods often lead to …

Improving Source Code Pre-training via Type-Specific Masking

W Zou, Q Li, C Li, J Ge, X Chen, LG Huang… - ACM Transactions on …, 2024 - dl.acm.org
The masked language modeling (MLM) task is widely recognized as one of the most
effective pre-training tasks and currently derives many variants in the software engineering …

Instructive Code Retriever: Learn from Large Language Model's Feedback for Code Intelligence Tasks

J Lu, H Wang, Z Liu, K Liang, L Bao… - Proceedings of the 39th …, 2024 - dl.acm.org
Recent studies proposed to leverage large language models (LLMs) with In-Context
Learning (ICL) to handle code intelligence tasks without fine-tuning. ICL employs task …

AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language Model

L Ma, W Yang, Y Li, B Fei, M Zhou, S Li, S Jiang… - arxiv preprint arxiv …, 2025 - arxiv.org
Automated log analysis is crucial to ensure high availability and reliability of complex
systems. The advent of LLMs in NLP has ushered in a new era of language model-driven …

Deep code search efficiency based on clustering

K Liu, J Liu, H Hu - Concurrency and Computation: Practice …, 2024 - Wiley Online Library
The deep‐learning based code search model mainly takes accuracy as the only target for
judging the performance of the model, ignoring the efficiency of code search. This article …