RSPrompter: Learning to prompt for remote sensing instance segmentation based on visual foundation model

K Chen, C Liu, H Chen, H Zhang, W Li… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Leveraging the extensive training data from SA-1B, the segment anything model (SAM)
demonstrates remarkable generalization and zero-shot capabilities. However, as a category …

Change-agent: Towards interactive comprehensive remote sensing change interpretation and analysis

C Liu, K Chen, H Zhang, Z Qi, Z Zou… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Monitoring changes in the Earth's surface is crucial for understanding natural processes and
human impacts, necessitating precise and comprehensive interpretation methodologies …

Semantic-cc: Boosting remote sensing image change captioning via foundational knowledge and semantic guidance

Y Zhu, L Li, K Chen, C Liu, F Zhou… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Remote sensing image change captioning (RSICC) aims to articulate the changes in objects
of interest within bitemporal remote sensing images using natural language. Given the …

Remote Sensing Temporal Vision-Language Models: A Comprehensive Survey

C Liu, J Zhang, K Chen, M Wang, Z Zou… - arxiv preprint arxiv …, 2024 - arxiv.org
Temporal image analysis in remote sensing has traditionally centered on change detection,
which identifies regions of change between images captured at different times. However …

[HTML][HTML] A Review of Deep Learning-Based Remote Sensing Image Caption: Methods, Models, Comparisons and Future Directions

K Zhang, P Li, J Wang - Remote Sensing, 2024 - mdpi.com
Remote sensing images contain a wealth of Earth-observation information. Efficient
extraction and application of hidden knowledge from these images will greatly promote the …

Ringmogpt: A unified remote sensing foundation model for vision, language, and grounded tasks

P Wang, H Hu, B Tong, Z Zhang, F Yao… - … on Geoscience and …, 2024 - ieeexplore.ieee.org
Recently, multi-modal large language models (MLLMs) have shown excellent reasoning
capabilities in various fields. Most of the existing remote sensing MLLMs solve image-level …

Ccexpert: Advancing mllm capability in remote sensing change captioning with difference-aware integration and a foundational dataset

Z Wang, M Wang, S Xu, Y Li, B Zhang - arxiv preprint arxiv:2411.11360, 2024 - arxiv.org
Remote Sensing Image Change Captioning (RSICC) aims to generate natural language
descriptions of surface changes between multi-temporal remote sensing images, detailing …

Mv-cc: Mask enhanced video model for remote sensing change caption

R Liu, K Li, J Song, D Sun, X Cao - arxiv preprint arxiv:2410.23946, 2024 - arxiv.org
Remote sensing image change caption (RSICC) aims to provide natural language
descriptions for bi-temporal remote sensing images. Since Change Caption (CC) task …

[HTML][HTML] Dual-Branch Seasonal Error Elimination Change Detection Framework Using Target Image Feature Fusion Generator

H Zhu, J Zhang, Z Wang, X Liu, Q Liu, B Du - Remote Sensing, 2025 - mdpi.com
In change detection tasks, seasonal variations in spectral characteristics and surface cover
can negatively impact performance when comparing image pairs from different seasons …

Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation

Z Qi, H Chen, H Zhang, Z Zou, Z Shi - arxiv preprint arxiv:2412.05969, 2024 - arxiv.org
In this paper, we propose a novel semantic splatting approach based on Gaussian Splatting
to achieve efficient and low-latency. Our method projects the RGB attributes and semantic …