A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability

X Huang, D Kroening, W Ruan, J Sharp, Y Sun… - Computer Science …, 2020 - Elsevier
In the past few years, significant progress has been made on deep neural networks (DNNs)
in achieving human-level performance on several long-standing tasks. With the broader …

A survey of safety and trustworthiness of large language models through the lens of verification and validation

X Huang, W Ruan, W Huang, G **, Y Dong… - Artificial Intelligence …, 2024 - Springer
Large language models (LLMs) have exploded a new heatwave of AI for their ability to
engage end-users in human-level conversations with detailed and articulate answers across …

An overview of verification and validation challenges for inspection robots

M Fisher, RC Cardoso, EC Collins, C Dadswell… - Robotics, 2021 - mdpi.com
The advent of sophisticated robotics and AI technology makes sending humans into
hazardous and distant environments to carry out inspections increasingly avoidable. Being …

Metamorphic testing of deep learning compilers

D **ao, Z Liu, Y Yuan, Q Pang, S Wang - Proceedings of the ACM on …, 2022 - dl.acm.org
The prosperous trend of deploying deep neural network (DNN) models to diverse hardware
platforms has boosted the development of deep learning (DL) compilers. DL compilers take …

Mttm: Metamorphic testing for textual content moderation software

W Wang, J Huang, W Wu, J Zhang… - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
The exponential growth of social media platforms such as Twitter and Facebook has
revolutionized textual communication and textual content publication in human society …

DeepState: selecting test suites to enhance the robustness of recurrent neural networks

Z Liu, Y Feng, Y Yin, Z Chen - … of the 44th International Conference on …, 2022 - dl.acm.org
Deep Neural Networks (DNN) have achieved tremendous success in various software
applications. However, accompanied by outstanding effectiveness, DNN-driven software …

DialTest: automated testing for recurrent-neural-network-driven dialogue systems

Z Liu, Y Feng, Z Chen - Proceedings of the 30th ACM SIGSOFT …, 2021 - dl.acm.org
With the tremendous advancement of recurrent neural networks (RNN), dialogue systems
have achieved significant development. Many RNN-driven dialogue systems, such as Siri …

AEON: a method for automatic evaluation of NLP test cases

J Huang, J Zhang, W Wang, P He, Y Su… - Proceedings of the 31st …, 2022 - dl.acm.org
Due to the labor-intensive nature of manual test oracle construction, various automated
testing techniques have been proposed to enhance the reliability of Natural Language …

Can Coverage Criteria Guide Failure Discovery for Image Classifiers? An Empirical Study

Z Wang, S Xu, L Fan, X Cai, L Li, Z Liu - ACM Transactions on Software …, 2024 - dl.acm.org
Quality assurance of deep neural networks (DNNs) is crucial for the deployment of DNN-
based software, especially in mission-and safety-critical tasks. Inspired by structural white …

Generalizing universal adversarial perturbations for deep neural networks

Y Zhang, W Ruan, F Wang, X Huang - Machine Learning, 2023 - Springer
Previous studies have shown that universal adversarial attacks can fool deep neural
networks over a large set of input images with a single human-invisible perturbation …