A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability
In the past few years, significant progress has been made on deep neural networks (DNNs)
in achieving human-level performance on several long-standing tasks. With the broader …
in achieving human-level performance on several long-standing tasks. With the broader …
A survey of safety and trustworthiness of large language models through the lens of verification and validation
Large language models (LLMs) have exploded a new heatwave of AI for their ability to
engage end-users in human-level conversations with detailed and articulate answers across …
engage end-users in human-level conversations with detailed and articulate answers across …
An overview of verification and validation challenges for inspection robots
The advent of sophisticated robotics and AI technology makes sending humans into
hazardous and distant environments to carry out inspections increasingly avoidable. Being …
hazardous and distant environments to carry out inspections increasingly avoidable. Being …
Metamorphic testing of deep learning compilers
The prosperous trend of deploying deep neural network (DNN) models to diverse hardware
platforms has boosted the development of deep learning (DL) compilers. DL compilers take …
platforms has boosted the development of deep learning (DL) compilers. DL compilers take …
Mttm: Metamorphic testing for textual content moderation software
The exponential growth of social media platforms such as Twitter and Facebook has
revolutionized textual communication and textual content publication in human society …
revolutionized textual communication and textual content publication in human society …
DeepState: selecting test suites to enhance the robustness of recurrent neural networks
Deep Neural Networks (DNN) have achieved tremendous success in various software
applications. However, accompanied by outstanding effectiveness, DNN-driven software …
applications. However, accompanied by outstanding effectiveness, DNN-driven software …
DialTest: automated testing for recurrent-neural-network-driven dialogue systems
With the tremendous advancement of recurrent neural networks (RNN), dialogue systems
have achieved significant development. Many RNN-driven dialogue systems, such as Siri …
have achieved significant development. Many RNN-driven dialogue systems, such as Siri …
AEON: a method for automatic evaluation of NLP test cases
Due to the labor-intensive nature of manual test oracle construction, various automated
testing techniques have been proposed to enhance the reliability of Natural Language …
testing techniques have been proposed to enhance the reliability of Natural Language …
Can Coverage Criteria Guide Failure Discovery for Image Classifiers? An Empirical Study
Quality assurance of deep neural networks (DNNs) is crucial for the deployment of DNN-
based software, especially in mission-and safety-critical tasks. Inspired by structural white …
based software, especially in mission-and safety-critical tasks. Inspired by structural white …
Generalizing universal adversarial perturbations for deep neural networks
Previous studies have shown that universal adversarial attacks can fool deep neural
networks over a large set of input images with a single human-invisible perturbation …
networks over a large set of input images with a single human-invisible perturbation …