Explainable artificial intelligence for autonomous driving: A comprehensive overview and field guide for future research directions

S Atakishiyev, M Salameh, H Yao, R Goebel - IEEE Access, 2024 - ieeexplore.ieee.org
Autonomous driving has achieved significant milestones in research and development over
the last two decades. There is increasing interest in the field as the deployment of …

ChatGPT-like large-scale foundation models for prognostics and health management: A survey and roadmaps

YF Li, H Wang, M Sun - Reliability Engineering & System Safety, 2024 - Elsevier
PHM technology is vital in industrial production and maintenance, identifying and predicting
potential equipment failures and damages. This enables proactive maintenance measures …

Openeqa: Embodied question answering in the era of foundation models

A Majumdar, A Ajay, X Zhang, P Putta… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present a modern formulation of Embodied Question Answering (EQA) as the task of
understanding an environment well enough to answer questions about it in natural …

Video graph transformer for video question answering

J **ao, P Zhou, TS Chua, S Yan - European Conference on Computer …, 2022 - Springer
This paper proposes a Video Graph Transformer (VGT) model for Video Question Answering
(VideoQA). VGT's uniqueness are two-fold: 1) it designs a dynamic graph transformer …

Can i trust your answer? visually grounded video question answering

J **ao, A Yao, Y Li, TS Chua - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
We study visually grounded VideoQA in response to the emerging trends of utilizing
pretraining techniques for video-language understanding. Specifically by forcing vision …

Intentqa: Context-aware video intent reasoning

J Li, P Wei, W Han, L Fan - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
In this paper, we propose a novel task IntentQA, a special VideoQA task focusing on video
intent reasoning, which has become increasingly important for AI with its advantages in …

Are binary annotations sufficient? video moment retrieval via hierarchical uncertainty-based active learning

W Ji, R Liang, Z Zheng, W Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent research on video moment retrieval has mostly focused on enhancing the
performance of accuracy, efficiency, and robustness, all of which largely rely on the …

Retrieving-to-answer: Zero-shot video question answering with frozen large language models

J Pan, Z Lin, Y Ge, X Zhu, R Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Video Question Answering (VideoQA) has been significantly advanced from the
scaling of recent Large Language Models (LLMs). The key idea is to convert the visual …

Discovering spatio-temporal rationales for video question answering

Y Li, J **ao, C Feng, X Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper strives to solve complex video question answering (VideoQA) which features
long videos containing multiple objects and events at different time. To tackle the challenge …

Contrastive video question answering via video graph transformer

J **ao, P Zhou, A Yao, Y Li, R Hong… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
We propose to perform video question answering (VideoQA) in a Co ntrastive manner via a
V ideo G raph T ransformer model (CoVGT). CoVGT's uniqueness and superiority are three …