Dual-path rare content enhancement network for image and text matching

Y Wang, Y Su, W Li, J **ao, X Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Image and text matching plays a crucial role in bridging the cross-modal gap between vision
and language, and has achieved great progress due to the deep learning. However, the …

OMCBIR: Offline mobile content-based image retrieval with lightweight CNN optimization

X Zhang, C Bai, K Kpalma - Displays, 2023 - Elsevier
Abstract Convolutional Neural Networks (CNNs) have achieved great success in computer
vision applications. However, due to the high requirements for computation power and …

Dual geometric perception for cross-domain road segmentation

W Zou, R Long, Y Zhang, M Liao, Z Zhou, S Tian - Displays, 2023 - Elsevier
Road segmentation plays an important role in navigation systems and autonomous driving.
However, many methods in road segmentation are based on supervised learning and suffer …

DHIQA: quality assessment of dehazed images based on attentive multi-scale feature fusion and rank learning

S Tian, T Zeng, W Zou, X Li, L Zhang - Displays, 2023 - Elsevier
Haze is a ubiquitous atmospheric phenomenon that seriously influences the visibility of
images. To this end, numerous image dehazing models have been proposed to improve the …

Hybrid attention network for image captioning

W Jiang, Q Li, K Zhan, Y Fang, F Shen - Displays, 2022 - Elsevier
Abstract Machine attention mechanisms are widely used in the task of image captioning.
Such mechanisms dynamically focus on different regions to guide the word generation …

ICEAP: An advanced fine-grained image captioning network with enhanced attribute predictor

MB Hossen, Z Ye, A Abdussalam, MA Hossain - Displays, 2024 - Elsevier
Fine-grained image captioning is a focal point in the vision-to-language task and has
attracted considerable attention for generating accurate and contextually relevant image …

Improving adversarial robustness of traffic sign image recognition networks

AS Hashemi, S Mozaffari, S Alirezaee - Displays, 2022 - Elsevier
The robustness of deep neural networks is an increasingly essential issue as they become
more and more prevalent in several real-world applications like autonomous vehicles. If …

Generative image inpainting with enhanced gated convolution and Transformers

M Wang, W Lu, J Lyu, K Shi, H Zhao - Displays, 2022 - Elsevier
Image inpainting is widely used to fill the damaged or masked area in an image with realistic
visual contents. However, most existing inpainting methods have limitations in …

Aligned visual semantic scene graph for image captioning

S Zhao, L Li, H Peng - Displays, 2022 - Elsevier
Image captioning is a multi-modal task to describe an image into natural language. Many
state-of-the-art methods generally take the encoder–decoder architecture, encode an image …

LRB-Net: Improving VQA via division of labor strategy and multimodal classifiers

J Feng, R Liu - Displays, 2022 - Elsevier
Visual question answering (VQA), along with multiple types of image and textual questions,
makes it a challenging task to infer the correct answer. Consequently, traditional methods …