Human-object interaction recognition for automatic construction site safety inspection

S Tang, D Roberts, M Golparvar-Fard - Automation in Construction, 2020 - Elsevier
Today, computer vision object detection methods are used for safety inspections from site
videos and images. These methods detect bounding boxes and use hand-made rules to …

A review on automatic image caption generation for various deep learning approaches

AP Singh, M Manoria, S Joshi - 2023 14th International …, 2023 - ieeexplore.ieee.org
In machine learning, image captioning is a process of extracting information as per the
action performed in an image. An image may contain various objects that may be associated …

Context-aware transformer for image captioning

X Yang, Y Wang, H Chen, J Li, T Huang - Neurocomputing, 2023 - Elsevier
Recently, image captioning models have made remarkable progress by introducing
transformer architecture, which utilizes self-attention to explore intra-and inter-modal …

Region-Focused Network for Dense Captioning

Q Huang, P Li, Y Huang, F Shuang, Y Cai - ACM Transactions on …, 2024 - dl.acm.org
Dense captioning is a very critical but under-explored task, which aims to densely detect
localized regions-of-interest (RoIs) and describe them with natural language in a given …

Dense video captioning using BiLSTM encoder

J Madake, S Bhatlawande, S Purandare… - 2022 3rd …, 2022 - ieeexplore.ieee.org
Video captioning has been a widely researched topic integrating visual information and
natural language but performing video captioning on long untrimmed videos is still …

Image retrieval using image captioning

N Vijayaraju - 2019 - scholarworks.sjsu.edu
The rapid growth in the availability of the Internet and smartphones have resulted in the
increase in usage of social media in recent years. This increased usage has thereby …

Relevant Visual Semantic Context-Aware Attention-Based Dialog.

ETB Hong, YW Chong, TC Wan… - Computers, Materials & …, 2023 - search.ebscohost.com
The existing dataset for visual dialog comprises multiple rounds of questions and a diverse
range of image contents. However, it faces challenges in overcoming visual semantic …

Image caption generation using deep residual learning

R Jain, A Jhapate, M Saxena - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
The process of creating a written description of an image that describes the action depicted
in it is known as image captioning. It is one of the most challenging study areas, and the only …

Survey of Semantic-Based Image-to-Image Retrieval

D Cao, H Zhou, H Yang - 2024 5th International Conference on …, 2024 - ieeexplore.ieee.org
Image-to-image retrieval has been a focal point of research in both academia and industry
over the past few decades, finding applications across various fields. However, with …

Automatic image caption generation: A review

V Verma, SK Saritha, S Jain - AIP Conference Proceedings, 2023 - pubs.aip.org
Image caption is a very popular approach through which descriptive language can be
generated in natural form. It is a challenging task in the field of Artificial Intelligence where …