Scene text detection and recognition: The deep learning era

S Long, X He, C Yao - International Journal of Computer Vision, 2021 - Springer
With the rise and development of deep learning, computer vision has been tremendously
transformed and reshaped. As an important research area in computer vision, scene text …

Text recognition in the wild: A survey

X Chen, L **, Y Zhu, C Luo, T Wang - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
The history of text can be traced back over thousands of years. Rich and precise semantic
information carried by text is important in a wide range of vision-based application …

Scene text recognition with permuted autoregressive sequence models

D Bautista, R Atienza - European conference on computer vision, 2022 - Springer
Context-aware STR methods typically use internal autoregressive (AR) language models
(LM). Inherent limitations of AR models motivated two-stage methods which employ an …

Read like humans: Autonomous, bidirectional and iterative language modeling for scene text recognition

S Fang, H **e, Y Wang, Z Mao… - Proceedings of the …, 2021 - openaccess.thecvf.com
Linguistic knowledge is of great benefit to scene text recognition. However, how to effectively
model linguistic rules in end-to-end deep networks remains a research challenge. In this …

Towards accurate scene text recognition with semantic reasoning networks

D Yu, X Li, C Zhang, T Liu, J Han… - Proceedings of the …, 2020 - openaccess.thecvf.com
Scene text image contains two levels of contents: visual texture and semantic information.
Although the previous scene text recognition methods have made great progress over the …

What is wrong with scene text recognition model comparisons? dataset and model analysis

J Baek, G Kim, J Lee, S Park, D Han… - Proceedings of the …, 2019 - openaccess.thecvf.com
Many new proposals for scene text recognition (STR) models have been introduced in
recent years. While each claim to have pushed the boundary of the technology, a holistic …

From two to one: A new scene text recognizer with visual language modeling network

Y Wang, H **e, S Fang, J Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we abandon the dominant complex language model and rethink the linguistic
learning process in the scene text recognition. Different from previous methods considering …

Seed: Semantics enhanced encoder-decoder framework for scene text recognition

Z Qiao, Y Zhou, D Yang, Y Zhou… - Proceedings of the …, 2020 - openaccess.thecvf.com
Scene text recognition is a hot research topic in computer vision. Recently, many recognition
methods based on the encoder-decoder framework have been proposed, and they can …

Mask textspotter: An end-to-end trainable neural network for spotting text with arbitrary shapes

P Lyu, M Liao, C Yao, W Wu… - Proceedings of the …, 2018 - openaccess.thecvf.com
Recently, models based on deep neural networks have dominated the fields of scene text
detection and recognition. In this paper, we investigate the problem of scene text spotting …

Pan++: Towards efficient and accurate end-to-end spotting of arbitrarily-shaped text

W Wang, E **e, X Li, X Liu, D Liang… - … on Pattern Analysis …, 2021 - ieeexplore.ieee.org
Scene text detection and recognition have been well explored in the past few years. Despite
the progress, efficient and accurate end-to-end spotting of arbitrarily-shaped text remains …