Common sense reasoning for deepfake detection

Y Zhang, B Colman, X Guo, A Shahriyari… - European Conference on …, 2024 - Springer
State-of-the-art deepfake detection approaches rely on image-based features extracted via
neural networks. While these approaches trained in a supervised manner extract likely fake …

Vision-and-language navigation today and tomorrow: A survey in the era of foundation models

Y Zhang, Z Ma, J Li, Y Qiao, Z Wang, J Chai… - ar** Language-Guided Navigation Learning with Self-Refining Data Flywheel
Z Wang, J Li, Y Hong, S Li, K Li, S Yu, Y Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Creating high-quality data for training robust language-instructed agents is a long-lasting
challenge in embodied AI. In this paper, we introduce a Self-Refining Data Flywheel (SRDF) …

A Global-Memory-Aware Transformer for Vision-and-Language Navigation

L Wang, X Wu - 2024 5th International Seminar on Artificial …, 2024 - ieeexplore.ieee.org
Vision-and-Language Navigation (VLN) needs an agent to navigate in 3D environments
under the guidance of natural language instructions, continuously exploring until reaching a …

[PDF][PDF] Neuro-symbolic Tuning for Multi-hop Reasoning over Spatial

T Premsri, P Kordjamshidi - 2024 - ceur-ws.org
Spatial Reasoning is a fundamental aspect of human cognition to perform everyday
activities. It is also an essential skill for machines to engage in human-like interactions with …