RingMo-sense: Remote sensing foundation model for spatiotemporal prediction via spatiotemporal evolution disentangling
F Yao, W Lu, H Yang, L Xu, C Liu, L Hu… - … on Geoscience and …, 2023 - ieeexplore.ieee.org
Remote sensing (RS) spatiotemporal prediction aims to infer future trends from historical
spatiotemporal data, eg, videos and time-series images, which has a broad application …
spatiotemporal data, eg, videos and time-series images, which has a broad application …
RingMo-SAM: A foundation model for segment anything in multimodal remote-sensing images
The proposal of the segment anything model (SAM) has created a new paradigm for the
deep-learning-based semantic segmentation field and has shown amazing generalization …
deep-learning-based semantic segmentation field and has shown amazing generalization …
Causal adversarial autoencoder for disentangled SAR image representation and few-shot target recognition
Lack of interpretability and weak generalization ability have become the major challenges
with data-driven intelligent synthetic aperture radar-automatic target recognition (SAR-ATR) …
with data-driven intelligent synthetic aperture radar-automatic target recognition (SAR-ATR) …
[HTML][HTML] GABLE: A first fine-grained 3D building model of China on a national scale from very high resolution satellite imagery
Abstract Three-dimensional (3D) building models provide horizontal and vertical information
of urban development patterns, which are significant to urbanization analysis, solar energy …
of urban development patterns, which are significant to urbanization analysis, solar energy …
Ucdnet: Multi-uav collaborative 3d object detection network by reliable feature map**
P Tian, Z Wang, P Cheng, Y Wang… - … on Geoscience and …, 2024 - ieeexplore.ieee.org
Multi-unmanned aerial vehicle (UAV) collaborative 3-D object detection can comprehend
complex environments by integrating complementary information, with applications …
complex environments by integrating complementary information, with applications …
Retentive Compensation and Personality Filtering for Few-Shot Remote Sensing Object Detection
In recent years, few-shot object detection (FSOD) in remote sensing images has attracted
increasing attention. Numerous studies address the challenges posed by both intra-class …
increasing attention. Numerous studies address the challenges posed by both intra-class …
Spatial guided image captioning: Guiding attention with object's spatial interaction
R Du, W Zhang, S Li, J Chen, Z Guo - IET Image Processing, 2024 - Wiley Online Library
Nowadays relational position embedding is widely used in many large multi‐modal models.
It begins with relational captioning (a branch of image captioning) and contains two …
It begins with relational captioning (a branch of image captioning) and contains two …
Balancing Attention to Base and Novel Categories for Few-Shot Object Detection in Remote Sensing Imagery
Z Zhu, P Wang, W Diao, J Yang, L Kong… - … on Geoscience and …, 2024 - ieeexplore.ieee.org
Few-shot object detection (FSOD) has garnered widespread attention in recent years, which
makes it possible to learn novel classes with only a handful of labeled samples. Due to the …
makes it possible to learn novel classes with only a handful of labeled samples. Due to the …
PW-MFL: Promoting Semantic Segmentation in Resolution-Degraded Aerial Images via Pixel-Wise Mutual-Feed Learning
J Yang, Y Wu, W Dai, W Diao, Z Zhu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Due to variable imaging conditions, resolution degradation often occurs in aerial images,
which in turn impairs the performance upper bound of semantic segmentation (SS). To solve …
which in turn impairs the performance upper bound of semantic segmentation (SS). To solve …
RingMo-Galaxy: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks
Remote sensing lightweight foundation models have successfully achieved online
perception, providing real-time intelligent interpretation. However, their capabilities are …
perception, providing real-time intelligent interpretation. However, their capabilities are …