Transformers in remote sensing: A survey

AA Aleissaee, A Kumar, RM Anwer, S Khan… - Remote Sensing, 2023 - mdpi.com
Deep learning-based algorithms have seen a massive popularity in different areas of remote
sensing image analysis over the past decade. Recently, transformer-based architectures …

Scene text detection and recognition: The deep learning era

S Long, X He, C Yao - International Journal of Computer Vision, 2021 - Springer
With the rise and development of deep learning, computer vision has been tremendously
transformed and reshaped. As an important research area in computer vision, scene text …

Git: A generative image-to-text transformer for vision and language

J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu… - arxiv preprint arxiv …, 2022 - arxiv.org
In this paper, we design and train a Generative Image-to-text Transformer, GIT, to unify
vision-language tasks such as image/video captioning and question answering. While …

Lvlm-ehub: A comprehensive evaluation benchmark for large vision-language models

P Xu, W Shao, K Zhang, P Gao, S Liu… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Large Vision-Language Models (LVLMs) have recently played a dominant role in
multimodal vision-language learning. Despite the great success, it lacks a holistic evaluation …

DOTA: A large-scale dataset for object detection in aerial images

GS **a, X Bai, J Ding, Z Zhu… - Proceedings of the …, 2018 - openaccess.thecvf.com
Object detection is an important and challenging problem in computer vision. Although the
past decade has witnessed major advances in object detection in natural scenes, such …

East: an efficient and accurate scene text detector

X Zhou, C Yao, H Wen, Y Wang… - Proceedings of the …, 2017 - openaccess.thecvf.com
Previous approaches for scene text detection have already achieved promising
performances across various benchmarks. However, they usually fall short when dealing …

Real-time scene text detection with differentiable binarization and adaptive scale fusion

M Liao, Z Zou, Z Wan, C Yao… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Recently, segmentation-based scene text detection methods have drawn extensive attention
in the scene text detection field, because of their superiority in detecting the text instances of …

Rethinking rotated object detection with gaussian wasserstein distance loss

X Yang, J Yan, Q Ming, W Wang… - … on machine learning, 2021 - proceedings.mlr.press
Boundary discontinuity and its inconsistency to the final detection metric have been the
bottleneck for rotating detection regression loss design. In this paper, we propose a novel …

Character region awareness for text detection

Y Baek, B Lee, D Han, S Yun… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Scene text detection methods based on neural networks have emerged recently and have
shown promising results. Previous methods trained with rigid word-level bounding boxes …

Trocr: Transformer-based optical character recognition with pre-trained models

M Li, T Lv, J Chen, L Cui, Y Lu, D Florencio… - Proceedings of the …, 2023 - ojs.aaai.org
Text recognition is a long-standing research problem for document digitalization. Existing
approaches are usually built based on CNN for image understanding and RNN for char …