Language Integration in Remote Sensing: Tasks, datasets, and future directions

L Bashmal, Y Bazi, F Melgani… - … and Remote Sensing …, 2023 - ieeexplore.ieee.org
The emerging field of vision–language models, which combines computer vision and natural
language processing (NLP), has gained significant interest and exploration. This integration …

QViLa: Quantum Infused Vision-Language Model for Enhanced Multimodal Understanding

K Mukesh, SL Jayaprakash, RP Kumar - SN Computer Science, 2024 - Springer
Vision-language models have emerged as transformative tools, revolutionizing the
integration of visual and textual information, forging pathways for nuanced interpretations …

Dynamic task and weight prioritization curriculum learning for multimodal imagery

HF Alsan, T Arsan - arxiv preprint arxiv:2310.19109, 2023 - arxiv.org
This paper explores post-disaster analytics using multimodal deep learning models trained
with curriculum learning method. Studying post-disaster analytics is important as it plays a …

Quakebert: Accurate classification of social media texts for rapid earthquake impact assessment

J Han, Z Zheng, XZ Lu, KY Chen, JR Lin - arxiv preprint arxiv:2405.06684, 2024 - arxiv.org
Social media aids disaster response but suffers from noise, hindering accurate impact
assessment and decision making for resilient cities, which few studies considered. To …

Insights into aerial intelligence: assessing CNN-based algorithms for human action recognition and object detection in diverse environments

K Maheriya, M Rahevar, H Mewada, M Parmar… - Multimedia Tools and …, 2024 - Springer
Today's era follows a data-driven decision process for large-scale environment analysis.
Aerial view-based decision process plays a key role in various domains including …

Beyond Words: Exploring Co-Attention with BERT in Visual Question Answering

N Kamble, S Karamadi, S Varur… - 2024 5th International …, 2024 - ieeexplore.ieee.org
The Visual Question Answering project described in this work uses a multi-modal approach
that combines Bidirectional Encoder Representations from Transformers for complex natural …

Enhanced earthquake impact analysis based on social media texts via large language model

J Han, Z Zheng, XZ Lu, KY Chen, JR Lin - International Journal of Disaster …, 2024 - Elsevier
Social media aids disaster response but suffers from noise, hindering accurate impact
assessment and decision making for resilient cities, which few studies considered. To …

Development of an Object Detection and Tracking Pipeline for Unmanned Aerial Vehicles in a Simulated Environment

H Ayers - 2024 - search.proquest.com
Object tracking and recognition are essential tools for advanced autonomous navigation
systems, particularly in Unmanned Aerial Vehicles (UAVs). This project centers around the …