RingMo-sense: Remote sensing foundation model for spatiotemporal prediction via spatiotemporal evolution disentangling

F Yao, W Lu, H Yang, L Xu, C Liu, L Hu… - … on Geoscience and …, 2023 - ieeexplore.ieee.org
Remote sensing (RS) spatiotemporal prediction aims to infer future trends from historical
spatiotemporal data, eg, videos and time-series images, which has a broad application …

RingMo-SAM: A foundation model for segment anything in multimodal remote-sensing images

Z Yan, J Li, X Li, R Zhou, W Zhang… - … on Geoscience and …, 2023 - ieeexplore.ieee.org
The proposal of the segment anything model (SAM) has created a new paradigm for the
deep-learning-based semantic segmentation field and has shown amazing generalization …

Causal adversarial autoencoder for disentangled SAR image representation and few-shot target recognition

Q Guo, H Xu, F Xu - IEEE Transactions on Geoscience and …, 2023 - ieeexplore.ieee.org
Lack of interpretability and weak generalization ability have become the major challenges
with data-driven intelligent synthetic aperture radar-automatic target recognition (SAR-ATR) …

[HTML][HTML] GABLE: A first fine-grained 3D building model of China on a national scale from very high resolution satellite imagery

X Sun, X Huang, Y Mao, T Sheng, J Li, Z Wang… - Remote Sensing of …, 2024 - Elsevier
Abstract Three-dimensional (3D) building models provide horizontal and vertical information
of urban development patterns, which are significant to urbanization analysis, solar energy …

Ucdnet: Multi-uav collaborative 3d object detection network by reliable feature map**

P Tian, Z Wang, P Cheng, Y Wang… - … on Geoscience and …, 2024 - ieeexplore.ieee.org
Multi-unmanned aerial vehicle (UAV) collaborative 3-D object detection can comprehend
complex environments by integrating complementary information, with applications …

Retentive Compensation and Personality Filtering for Few-Shot Remote Sensing Object Detection

J Wu, C Lang, G Cheng, X **e… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In recent years, few-shot object detection (FSOD) in remote sensing images has attracted
increasing attention. Numerous studies address the challenges posed by both intra-class …

Spatial guided image captioning: Guiding attention with object's spatial interaction

R Du, W Zhang, S Li, J Chen, Z Guo - IET Image Processing, 2024 - Wiley Online Library
Nowadays relational position embedding is widely used in many large multi‐modal models.
It begins with relational captioning (a branch of image captioning) and contains two …

Balancing Attention to Base and Novel Categories for Few-Shot Object Detection in Remote Sensing Imagery

Z Zhu, P Wang, W Diao, J Yang, L Kong… - … on Geoscience and …, 2024 - ieeexplore.ieee.org
Few-shot object detection (FSOD) has garnered widespread attention in recent years, which
makes it possible to learn novel classes with only a handful of labeled samples. Due to the …

PW-MFL: Promoting Semantic Segmentation in Resolution-Degraded Aerial Images via Pixel-Wise Mutual-Feed Learning

J Yang, Y Wu, W Dai, W Diao, Z Zhu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Due to variable imaging conditions, resolution degradation often occurs in aerial images,
which in turn impairs the performance upper bound of semantic segmentation (SS). To solve …

RingMo-Galaxy: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks

Z Wang, Z Wang, P Cheng, L Zhao… - … on Geoscience and …, 2024 - ieeexplore.ieee.org
Remote sensing lightweight foundation models have successfully achieved online
perception, providing real-time intelligent interpretation. However, their capabilities are …