- Academic Search

Y Zhang, B Colman, X Guo, A Shahriyari… - European Conference on …, 2024 - Springer

State-of-the-art deepfake detection approaches rely on image-based features extracted via
neural networks. While these approaches trained in a supervised manner extract likely fake …

Zapisz Cytuj Cytowane przez 16 Powiązane artykuły Wszystkie wersje 2

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Vision-and-language navigation today and tomorrow: A survey in the era of foundation models

Y Zhang, Z Ma, J Li, Y Qiao, Z Wang, J Chai… - ar** Language-Guided Navigation Learning with Self-Refining Data Flywheel

Z Wang, J Li, Y Hong, S Li, K Li, S Yu, Y Wang… - arxiv preprint arxiv …, 2024 - arxiv.org

Creating high-quality data for training robust language-instructed agents is a long-lasting
challenge in embodied AI. In this paper, we introduce a Self-Refining Data Flywheel (SRDF) …

Zapisz Cytuj Powiązane artykuły Wszystkie wersje 3 Wersja HTML

A Global-Memory-Aware Transformer for Vision-and-Language Navigation

L Wang, X Wu - 2024 5th International Seminar on Artificial …, 2024 - ieeexplore.ieee.org

Vision-and-Language Navigation (VLN) needs an agent to navigate in 3D environments
under the guidance of natural language instructions, continuously exploring until reaching a …

Zapisz Cytuj Powiązane artykuły

[Free GPT-4]
[DeepSeek]

[PDF] ceur-ws.org

[PDF][PDF] Neuro-symbolic Tuning for Multi-hop Reasoning over Spatial

T Premsri, P Kordjamshidi - 2024 - ceur-ws.org

Spatial Reasoning is a fundamental aspect of human cognition to perform everyday
activities. It is also an essential skill for machines to engage in human-like interactions with …

Zapisz Cytuj Powiązane artykuły Wszystkie wersje 2 Wersja HTML

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

Lovis: Learning orientation and visual signals for vision and language navigation

Common sense reasoning for deepfake detection

Vision-and-language navigation today and tomorrow: A survey in the era of foundation models

A Global-Memory-Aware Transformer for Vision-and-Language Navigation

[PDF][PDF] Neuro-symbolic Tuning for Multi-hop Reasoning over Spatial