Vision-language models for medical report generation and visual question answering: A review

I Hartsock, G Rasool - Frontiers in Artificial Intelligence, 2024 - frontiersin.org
Medical vision-language models (VLMs) combine computer vision (CV) and natural
language processing (NLP) to analyze visual and textual medical data. Our paper reviews …

Cross-city matters: A multimodal remote sensing benchmark dataset for cross-city semantic segmentation using high-resolution domain adaptation networks

D Hong, B Zhang, H Li, Y Li, J Yao, C Li… - Remote Sensing of …, 2023 - Elsevier
Artificial intelligence (AI) approaches nowadays have gained remarkable success in single-
modality-dominated remote sensing (RS) applications, especially with an emphasis on …

UIU-Net: U-Net in U-Net for infrared small object detection

X Wu, D Hong, J Chanussot - IEEE Transactions on Image …, 2022 - ieeexplore.ieee.org
Learning-based infrared small object detection methods currently rely heavily on the
classification backbone network. This tends to result in tiny object loss and feature …

Bi-directional cross-modality feature propagation with separation-and-aggregation gate for RGB-D semantic segmentation

X Chen, KY Lin, J Wang, W Wu, C Qian, H Li… - European conference on …, 2020 - Springer
Depth information has proven to be a useful cue in the semantic segmentation of RGB-D
images for providing a geometric counterpart to the RGB representation. Most existing works …

Multiattention network for semantic segmentation of fine-resolution remote sensing images

R Li, S Zheng, C Zhang, C Duan, J Su… - … on Geoscience and …, 2021 - ieeexplore.ieee.org
Semantic segmentation of remote sensing images plays an important role in a wide range of
applications, including land resource management, biosphere monitoring, and urban …

CPFNet: Context pyramid fusion network for medical image segmentation

S Feng, H Zhao, F Shi, X Cheng… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
Accurate and automatic segmentation of medical images is a crucial step for clinical
diagnosis and analysis. The convolutional neural network (CNN) approaches based on the …

CTNet: Context-based tandem network for semantic segmentation

Z Li, Y Sun, L Zhang, J Tang - IEEE Transactions on Pattern …, 2021 - ieeexplore.ieee.org
Contextual information has been shown to be powerful for semantic segmentation. This work
proposes a novel Context-based Tandem Network (CTNet) by interactively exploring the …

A survey on deep learning based approaches for scene understanding in autonomous driving

Z Guo, Y Huang, X Hu, H Wei, B Zhao - Electronics, 2021 - mdpi.com
As a prerequisite for autonomous driving, scene understanding has attracted extensive
research. With the rise of the convolutional neural network (CNN)-based deep learning …

Deep learning for understanding satellite imagery: An experimental survey

SP Mohanty, J Czakon, KA Kaczmarek… - Frontiers in Artificial …, 2020 - frontiersin.org
Translating satellite imagery into maps requires intensive effort and time, especially leading
to inaccurate maps of the affected regions during disaster and conflict. The combination of …

Unsupervised domain adaptation for medical image segmentation by disentanglement learning and self-training

Q **e, Y Li, N He, M Ning, K Ma, G Wang… - … on Medical Imaging, 2022 - ieeexplore.ieee.org
Unsupervised domain adaption (UDA), which aims to enhance the segmentation
performance of deep models on unlabeled data, has recently drawn much attention. In this …