Software testing with large language models: Survey, landscape, and vision

J Wang, Y Huang, C Chen, Z Liu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Pre-trained large language models (LLMs) have recently emerged as a breakthrough
technology in natural language processing and artificial intelligence, with the ability to …

Deep learning-based software engineering: progress, challenges, and opportunities

X Chen, X Hu, Y Huang, H Jiang, W Ji, Y Jiang… - Science China …, 2025 - Springer
Researchers have recently achieved significant advances in deep learning techniques,
which in turn has substantially advanced other research disciplines, such as natural …

Large language models are zero-shot fuzzers: Fuzzing deep-learning libraries via large language models

Y Deng, CS **a, H Peng, C Yang, L Zhang - Proceedings of the 32nd …, 2023 - dl.acm.org
Deep Learning (DL) systems have received exponential growth in popularity and have
become ubiquitous in our everyday life. Such systems are built on top of popular DL …

Large language models are edge-case fuzzers: Testing deep learning libraries via fuzzgpt

Y Deng, CS **a, C Yang, SD Zhang, S Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
Deep Learning (DL) library bugs affect downstream DL applications, emphasizing the need
for reliable systems. Generating valid input programs for fuzzing DL libraries is challenging …

Large language models for compiler optimization

C Cummins, V Seeker, D Grubisic, M Elhoushi… - arxiv preprint arxiv …, 2023 - arxiv.org
We explore the novel application of Large Language Models to code optimization. We
present a 7B-parameter transformer model trained from scratch to optimize LLVM assembly …

Large language models are edge-case generators: Crafting unusual programs for fuzzing deep learning libraries

Y Deng, CS **a, C Yang, SD Zhang, S Yang… - Proceedings of the 46th …, 2024 - dl.acm.org
Bugs in Deep Learning (DL) libraries may affect almost all downstream DL applications, and
it is crucial to ensure the quality of such systems. It is challenging to generate valid input …

Fuzzing deep-learning libraries via automated relational api inference

Y Deng, C Yang, A Wei, L Zhang - Proceedings of the 30th ACM Joint …, 2022 - dl.acm.org
Deep Learning (DL) has gained wide attention in recent years. Meanwhile, bugs in DL
systems can lead to serious consequences, and may even threaten human lives. As a result …

Nnsmith: Generating diverse and valid test cases for deep learning compilers

J Liu, J Lin, F Ruffy, C Tan, J Li, A Panda… - Proceedings of the 28th …, 2023 - dl.acm.org
Deep-learning (DL) compilers such as TVM and TensorRT are increasingly being used to
optimize deep neural network (DNN) models to meet performance, resource utilization and …

Make llm a testing expert: Bringing human-like interaction to mobile gui testing via functionality-aware decisions

Z Liu, C Chen, J Wang, M Chen, B Wu, X Che… - Proceedings of the …, 2024 - dl.acm.org
Automated Graphical User Interface (GUI) testing plays a crucial role in ensuring app quality,
especially as mobile applications have become an integral part of our daily lives. Despite …

When software security meets large language models: A survey

X Zhu, W Zhou, QL Han, W Ma, S Wen… - IEEE/CAA Journal of …, 2025 - ieeexplore.ieee.org
Software security poses substantial risks to our society because software has become part of
our life. Numerous techniques have been proposed to resolve or mitigate the impact of …