Multimodal feature-guided pre-training for RGB-T perception

J Ouyang, P **, Q Wang - IEEE Journal of Selected Topics in …, 2024 - ieeexplore.ieee.org
Wide-range multiscale object detection for multispectral scene perception from a drone
perspective is challenging. Previous RGB-T perception methods directly use backbone …

Feature guided masked autoencoder for self-supervised learning in remote sensing

Y Wang, HH Hernández, CM Albrecht… - IEEE Journal of …, 2024 - ieeexplore.ieee.org
Self-supervised learning guided by masked image modeling, such as masked autoencoder
(MAE), has attracted wide attention for pretraining vision transformers in remote sensing …

Joint classification of hyperspectral image and lidar data based on spectral prompt tuning

Y Kong, Y Cheng, Y Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The pretrained vision-language models (VLMs) have achieved outstanding performance in
various visual tasks, primarily due to the knowledge they have acquired from massive image …

[HTML][HTML] Real-time wildfire monitoring using low-Altitude Remote sensing imagery

H Tong, J Yuan, J Zhang, H Wang, T Li - Remote Sensing, 2024 - mdpi.com
With rising global temperatures, wildfires frequently occur worldwide during the summer
season. The timely detection of these fires, based on unmanned aerial vehicle (UAV) …

SDDiff: Semi-supervised surface defect detection with Diffusion Probabilistic Model

X Wang, W Li, L Cui, N Ouyang - Measurement, 2024 - Elsevier
Vision-based deep-learning methods are widely employed for Surface Defect Detection
(SDD). However, as learning often needs extensive high-precision annotations, the practical …

A Convolution with Transformer Attention Module Integrating Local and Global Features for Object Detection in Remote Sensing Based on YOLOv8n

K Lang, J Cui, M Yang, H Wang, Z Wang, H Shen - Remote Sensing, 2024 - mdpi.com
Object detection in remote sensing scenarios plays an indispensable and significant role in
civilian, commercial, and military areas, leveraging the power of convolutional neural …

[HTML][HTML] MGCET: MLP-mixer and Graph Convolutional Enhanced Transformer for Hyperspectral Image Classification

MAA Al-qaness, G Wu, D AL-Alimi - Remote Sensing, 2024 - mdpi.com
The vision transformer (ViT) has demonstrated performance comparable to that of
convolutional neural networks (CNN) in the hyperspectral image classification domain. This …

HSIMAE: A Unified Masked Autoencoder with Large-scale Pre-training for Hyperspectral Image Classification

Y Wang, M Wen, H Zhang, J Sun… - IEEE Journal of …, 2024 - ieeexplore.ieee.org
With a spurt of progress in deep learning techniques, convolutional neural network-based
and transformer-based methods have yielded impressive performance on the hyperspectral …

MHFNet: An improved HGR multimodal network for informative correlation fusion in remote sensing image classification

H Zhang, SL Huang… - IEEE Journal of Selected …, 2024 - ieeexplore.ieee.org
In the realm of urban development, the precise classification and identification of land types
are crucial for improving land use efficiency. This article proposes a land recognition and …

BiMAE-A Bimodal Masked Autoencoder Architecture for Single-Label Hyperspectral Image Classification

M Kukushkin, M Bogdan… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Hyperspectral imaging offers manifold opportunities for applications that may not or only
partially be achieved within the visual spectrum. Our paper presents a novel approach for …