Vision-language models for medical report generation and visual question answering: A review
Medical vision-language models (VLMs) combine computer vision (CV) and natural
language processing (NLP) to analyze visual and textual medical data. Our paper reviews …
language processing (NLP) to analyze visual and textual medical data. Our paper reviews …
Cross-city matters: A multimodal remote sensing benchmark dataset for cross-city semantic segmentation using high-resolution domain adaptation networks
Artificial intelligence (AI) approaches nowadays have gained remarkable success in single-
modality-dominated remote sensing (RS) applications, especially with an emphasis on …
modality-dominated remote sensing (RS) applications, especially with an emphasis on …
UIU-Net: U-Net in U-Net for infrared small object detection
Learning-based infrared small object detection methods currently rely heavily on the
classification backbone network. This tends to result in tiny object loss and feature …
classification backbone network. This tends to result in tiny object loss and feature …
Bi-directional cross-modality feature propagation with separation-and-aggregation gate for RGB-D semantic segmentation
Depth information has proven to be a useful cue in the semantic segmentation of RGB-D
images for providing a geometric counterpart to the RGB representation. Most existing works …
images for providing a geometric counterpart to the RGB representation. Most existing works …
Multiattention network for semantic segmentation of fine-resolution remote sensing images
Semantic segmentation of remote sensing images plays an important role in a wide range of
applications, including land resource management, biosphere monitoring, and urban …
applications, including land resource management, biosphere monitoring, and urban …
CPFNet: Context pyramid fusion network for medical image segmentation
Accurate and automatic segmentation of medical images is a crucial step for clinical
diagnosis and analysis. The convolutional neural network (CNN) approaches based on the …
diagnosis and analysis. The convolutional neural network (CNN) approaches based on the …
CTNet: Context-based tandem network for semantic segmentation
Contextual information has been shown to be powerful for semantic segmentation. This work
proposes a novel Context-based Tandem Network (CTNet) by interactively exploring the …
proposes a novel Context-based Tandem Network (CTNet) by interactively exploring the …
A survey on deep learning based approaches for scene understanding in autonomous driving
Z Guo, Y Huang, X Hu, H Wei, B Zhao - Electronics, 2021 - mdpi.com
As a prerequisite for autonomous driving, scene understanding has attracted extensive
research. With the rise of the convolutional neural network (CNN)-based deep learning …
research. With the rise of the convolutional neural network (CNN)-based deep learning …
Deep learning for understanding satellite imagery: An experimental survey
SP Mohanty, J Czakon, KA Kaczmarek… - Frontiers in Artificial …, 2020 - frontiersin.org
Translating satellite imagery into maps requires intensive effort and time, especially leading
to inaccurate maps of the affected regions during disaster and conflict. The combination of …
to inaccurate maps of the affected regions during disaster and conflict. The combination of …
Unsupervised domain adaptation for medical image segmentation by disentanglement learning and self-training
Unsupervised domain adaption (UDA), which aims to enhance the segmentation
performance of deep models on unlabeled data, has recently drawn much attention. In this …
performance of deep models on unlabeled data, has recently drawn much attention. In this …