Transformers in remote sensing: A survey

AA Aleissaee, A Kumar, RM Anwer, S Khan… - Remote Sensing, 2023 - mdpi.com
Deep learning-based algorithms have seen a massive popularity in different areas of remote
sensing image analysis over the past decade. Recently, transformer-based architectures …

Scene text detection and recognition: The deep learning era

S Long, X He, C Yao - International Journal of Computer Vision, 2021 - Springer
With the rise and development of deep learning, computer vision has been tremendously
transformed and reshaped. As an important research area in computer vision, scene text …

Deepseek-vl: towards real-world vision-language understanding

H Lu, W Liu, B Zhang, B Wang, K Dong, B Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
We present DeepSeek-VL, an open-source Vision-Language (VL) Model designed for real-
world vision and language understanding applications. Our approach is structured around …

Learning high-precision bounding box for rotated object detection via kullback-leibler divergence

X Yang, X Yang, J Yang, Q Ming… - Advances in …, 2021 - proceedings.neurips.cc
Existing rotated object detectors are mostly inherited from the horizontal detection paradigm,
as the latter has evolved into a well-developed area. However, these detectors are difficult to …

Turning a clip model into a scene text detector

W Yu, Y Liu, W Hua, D Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recent large-scale Contrastive Language-Image Pretraining (CLIP) model has shown
great potential in various downstream tasks via leveraging the pretrained vision and …

Phase-shifting coder: Predicting accurate orientation in oriented object detection

Y Yu, F Da - Proceedings of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
With the vigorous development of computer vision, oriented object detection has gradually
been featured. In this paper, a novel differentiable angle coder named phase-shifting coder …

Rethinking rotated object detection with gaussian wasserstein distance loss

X Yang, J Yan, Q Ming, W Wang… - … on machine learning, 2021 - proceedings.mlr.press
Boundary discontinuity and its inconsistency to the final detection metric have been the
bottleneck for rotating detection regression loss design. In this paper, we propose a novel …

Deepsolo: Let transformer decoder with explicit points solo for text spotting

M Ye, J Zhang, S Zhao, J Liu, T Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
End-to-end text spotting aims to integrate scene text detection and recognition into a unified
framework. Dealing with the relationship between the two sub-tasks plays a pivotal role in …

The KFIoU loss for rotated object detection

X Yang, Y Zhou, G Zhang, J Yang, W Wang… - arxiv preprint arxiv …, 2022 - arxiv.org
Differing from the well-developed horizontal object detection area whereby the computing-
friendly IoU based loss is readily adopted and well fits with the detection metrics. In contrast …

Arbitrary-oriented object detection with circular smooth label

X Yang, J Yan - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
Arbitrary-oriented object detection has recently attracted increasing attention in vision for
their importance in aerial imagery, scene text, and face etc. In this paper, we show that …