Advancing Object Detection in Transportation with Multimodal Large Language Models (MLLMs): A Comprehensive Review and Empirical Testing

HI Ashqar, A Jaber, TI Alhadidi, M Elhenawy - arxiv preprint arxiv …, 2024 - arxiv.org
This study aims to comprehensively review and empirically evaluate the application of
multimodal large language models (MLLMs) and Large Vision Models (VLMs) in object …

Visual Reasoning and Multi-Agent Approach in Multimodal Large Language Models (MLLMs): Solving TSP and mTSP Combinatorial Challenges

M Elhenawy, A Abutahoun, TI Alhadidi, A Jaber… - arxiv preprint arxiv …, 2024 - arxiv.org
Multimodal Large Language Models (MLLMs) harness comprehensive knowledge spanning
text, images, and audio to adeptly tackle complex problems, including zero-shot in-context …

[HTML][HTML] Leveraging Multimodal Large Language Models (MLLMs) for Enhanced Object Detection and Scene Understanding in Thermal Images for Autonomous …

HI Ashqar, TI Alhadidi, M Elhenawy, NO Khanfar - Automation, 2024 - mdpi.com
The integration of thermal imaging data with multimodal large language models (MLLMs)
offers promising advancements for enhancing the safety and functionality of autonomous …

Leveraging Large Language Models (LLMs) for Traffic Management at Urban Intersections: The Case of Mixed Traffic Scenarios

S Masri, HI Ashqar, M Elhenawy - arxiv preprint arxiv:2408.00948, 2024 - arxiv.org
Urban traffic management faces significant challenges due to the dynamic environments,
and traditional algorithms fail to quickly adapt to this environment in real-time and predict …

Leveraging Deep Learning and Multimodal Large Language Models for Near-Miss Detection Using Crowdsourced Videos

S Jaradat, M Elhenawy, HI Ashqar… - IEEE Open Journal of …, 2025 - ieeexplore.ieee.org
Near-miss traffic incidents, positioned just above" unsafe acts" on the safety triangle theory,
offer crucial predictive insights for preventing crashes. However, these incidents are often …