Surveying neuro-symbolic approaches for reliable artificial intelligence of things

Z Lu, I Afridi, HJ Kang, I Ruchkin, X Zheng - Journal of Reliable Intelligent …, 2024 - Springer
Abstract The integration of Artificial Intelligence (AI) with the Internet of Things (IoT), known
as the Artificial Intelligence of Things (AIoT), enhances the devices' processing and analysis …

A & b== b & a: Triggering logical reasoning failures in large language models

Y Wan, W Wang, Y Yang, Y Yuan, J Huang… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advancements in large language models (LLMs) have propelled Artificial
Intelligence (AI) to new heights, enabling breakthroughs in various tasks such as writing …

Marmot: Metamorphic runtime monitoring of autonomous driving systems

J Ayerdi, A Iriarte, P Valle, I Roman… - ACM Transactions on …, 2024 - dl.acm.org
Autonomous driving systems (ADSs) are complex cyber-physical systems (CPSs) that must
ensure safety even in uncertain conditions. Modern ADSs often employ deep neural …

Met-mapf: A metamorphic testing approach for multi-agent path finding algorithms

XY Zhang, Y Liu, P Arcaini, M Jiang… - ACM Transactions on …, 2024 - dl.acm.org
The Multi-Agent Path Finding (MAPF) problem, ie, the scheduling of multiple agents to reach
their destinations, has been widely investigated. Testing MAPF systems is challenging, due …

Identifying the Failure-Revealing Test Cases in Metamorphic Testing: A Statistical Approach

Z Zheng, D Ren, H Liu, TY Chen, T Li - ACM Transactions on Software …, 2024 - dl.acm.org
Metamorphic testing, thanks to its high failure-detection effectiveness especially in the
absence of test oracle, has been widely applied in both the traditional context of software …

Metamorphic runtime monitoring of autonomous driving systems

J Ayerdi, A Iriarte, P Valle, I Roman… - arxiv preprint arxiv …, 2023 - arxiv.org
Autonomous Driving Systems (ADSs) are complex Cyber-Physical Systems (CPSs) that
must ensure safety even in uncertain conditions. Modern ADSs often employ Deep Neural …

The earth is flat? unveiling factual errors in large language models

W Wang, J Shi, Z Tu, Y Yuan, J Huang, W Jiao… - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) like ChatGPT are foundational in various applications due
to their extensive knowledge from pre-training and fine-tuning. Despite this, they are prone …

Metamorphic Relation Generation: State of the Art and Visions for Future Research

R Li, H Liu, PL Poon, D Towey, CA Sun… - arxiv preprint arxiv …, 2024 - arxiv.org
Metamorphic testing has become one mainstream technique to address the notorious oracle
problem in software testing, thanks to its great successes in revealing real-life bugs in a wide …

MetaSem: metamorphic testing based on semantic information of autonomous driving scenes

Z Yang, S Huang, T Bai, Y Yao, Y Wang… - Software Testing …, 2024 - Wiley Online Library
The development of artificial intelligence and information communication technology has
significantly propelled advancements in autonomous driving. The advent of autonomous …

Effectiveness of symmetric metamorphic relations on validating the stability of code generation LLM

PYP Chan, J Keung, Z Yang - Journal of Systems and Software, 2025 - Elsevier
Pre-trained large language models (LLMs) are increasingly used in software development
for code generation, with a preference for private LLMs over public ones to avoid the risk of …