Scene text detection and recognition: The deep learning era
With the rise and development of deep learning, computer vision has been tremendously
transformed and reshaped. As an important research area in computer vision, scene text …
transformed and reshaped. As an important research area in computer vision, scene text …
Text recognition in the wild: A survey
The history of text can be traced back over thousands of years. Rich and precise semantic
information carried by text is important in a wide range of vision-based application …
information carried by text is important in a wide range of vision-based application …
Mpdiou: a loss for efficient and accurate bounding box regression
S Ma, Y Xu - arxiv preprint arxiv:2307.07662, 2023 - arxiv.org
Bounding box regression (BBR) has been widely used in object detection and instance
segmentation, which is an important step in object localization. However, most of the existing …
segmentation, which is an important step in object localization. However, most of the existing …
Git: A generative image-to-text transformer for vision and language
In this paper, we design and train a Generative Image-to-text Transformer, GIT, to unify
vision-language tasks such as image/video captioning and question answering. While …
vision-language tasks such as image/video captioning and question answering. While …
Real-time scene text detection with differentiable binarization and adaptive scale fusion
Recently, segmentation-based scene text detection methods have drawn extensive attention
in the scene text detection field, because of their superiority in detecting the text instances of …
in the scene text detection field, because of their superiority in detecting the text instances of …
Read like humans: Autonomous, bidirectional and iterative language modeling for scene text recognition
Linguistic knowledge is of great benefit to scene text recognition. However, how to effectively
model linguistic rules in end-to-end deep networks remains a research challenge. In this …
model linguistic rules in end-to-end deep networks remains a research challenge. In this …
Real-time scene text detection with differentiable binarization
Recently, segmentation-based methods are quite popular in scene text detection, as the
segmentation results can more accurately describe scene text of various shapes such as …
segmentation results can more accurately describe scene text of various shapes such as …
Deepsolo: Let transformer decoder with explicit points solo for text spotting
End-to-end text spotting aims to integrate scene text detection and recognition into a unified
framework. Dealing with the relationship between the two sub-tasks plays a pivotal role in …
framework. Dealing with the relationship between the two sub-tasks plays a pivotal role in …
From two to one: A new scene text recognizer with visual language modeling network
In this paper, we abandon the dominant complex language model and rethink the linguistic
learning process in the scene text recognition. Different from previous methods considering …
learning process in the scene text recognition. Different from previous methods considering …
Character region awareness for text detection
Scene text detection methods based on neural networks have emerged recently and have
shown promising results. Previous methods trained with rigid word-level bounding boxes …
shown promising results. Previous methods trained with rigid word-level bounding boxes …