Scene text detection and recognition: The deep learning era

S Long, X He, C Yao - International Journal of Computer Vision, 2021 - Springer
With the rise and development of deep learning, computer vision has been tremendously
transformed and reshaped. As an important research area in computer vision, scene text …

Text recognition in the wild: A survey

X Chen, L **, Y Zhu, C Luo, T Wang - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
The history of text can be traced back over thousands of years. Rich and precise semantic
information carried by text is important in a wide range of vision-based application …

Trocr: Transformer-based optical character recognition with pre-trained models

M Li, T Lv, J Chen, L Cui, Y Lu, D Florencio… - Proceedings of the …, 2023 - ojs.aaai.org
Text recognition is a long-standing research problem for document digitalization. Existing
approaches are usually built based on CNN for image understanding and RNN for char …

Read like humans: Autonomous, bidirectional and iterative language modeling for scene text recognition

S Fang, H **e, Y Wang, Z Mao… - Proceedings of the …, 2021 - openaccess.thecvf.com
Linguistic knowledge is of great benefit to scene text recognition. However, how to effectively
model linguistic rules in end-to-end deep networks remains a research challenge. In this …

From two to one: A new scene text recognizer with visual language modeling network

Y Wang, H **e, S Fang, J Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we abandon the dominant complex language model and rethink the linguistic
learning process in the scene text recognition. Different from previous methods considering …

Towards accurate scene text recognition with semantic reasoning networks

D Yu, X Li, C Zhang, T Liu, J Han… - Proceedings of the …, 2020 - openaccess.thecvf.com
Scene text image contains two levels of contents: visual texture and semantic information.
Although the previous scene text recognition methods have made great progress over the …

Unbalanced feature transport for exemplar-based image translation

F Zhan, Y Yu, K Cui, G Zhang, S Lu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Despite the great success of GANs in images translation with different conditioned inputs
such as semantic segmentation and edge map, generating high-fidelity images with …

Diverse image inpainting with bidirectional and autoregressive transformers

Y Yu, F Zhan, R Wu, J Pan, K Cui, S Lu, F Ma… - Proceedings of the 29th …, 2021 - dl.acm.org
Image inpainting is an underdetermined inverse problem, which naturally allows diverse
contents to fill up the missing or corrupted regions realistically. Prevalent approaches using …

Seed: Semantics enhanced encoder-decoder framework for scene text recognition

Z Qiao, Y Zhou, D Yang, Y Zhou… - Proceedings of the …, 2020 - openaccess.thecvf.com
Scene text recognition is a hot research topic in computer vision. Recently, many recognition
methods based on the encoder-decoder framework have been proposed, and they can …

Decoupled attention network for text recognition

T Wang, Y Zhu, L **, C Luo, X Chen, Y Wu… - Proceedings of the …, 2020 - ojs.aaai.org
Text recognition has attracted considerable research interests because of its various
applications. The cutting-edge text recognition methods are based on attention mechanisms …