Scene text detection and recognition: The deep learning era

S Long, X He, C Yao - International Journal of Computer Vision, 2021 - Springer
With the rise and development of deep learning, computer vision has been tremendously
transformed and reshaped. As an important research area in computer vision, scene text …

Text recognition in the wild: A survey

X Chen, L **, Y Zhu, C Luo, T Wang - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
The history of text can be traced back over thousands of years. Rich and precise semantic
information carried by text is important in a wide range of vision-based application …

Stable video diffusion: Scaling latent video diffusion models to large datasets

A Blattmann, T Dockhorn, S Kulal… - arxiv preprint arxiv …, 2023 - arxiv.org
We present Stable Video Diffusion-a latent video diffusion model for high-resolution, state-of-
the-art text-to-video and image-to-video generation. Recently, latent diffusion models trained …

Real-time scene text detection with differentiable binarization and adaptive scale fusion

M Liao, Z Zou, Z Wan, C Yao… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Recently, segmentation-based scene text detection methods have drawn extensive attention
in the scene text detection field, because of their superiority in detecting the text instances of …

Ocr-free document understanding transformer

G Kim, T Hong, M Yim, JY Nam, J Park, J Yim… - … on Computer Vision, 2022 - Springer
Understanding document images (eg, invoices) is a core but challenging task since it
requires complex functions such as reading text and a holistic understanding of the …

Fourier contour embedding for arbitrary-shaped text detection

Y Zhu, J Chen, L Liang, Z Kuang… - Proceedings of the …, 2021 - openaccess.thecvf.com
One of the main challenges for arbitrary-shaped text detection is to design a good text
instance representation that allows networks to learn diverse text geometry variances. Most …

Real-time scene text detection with differentiable binarization

M Liao, Z Wan, C Yao, K Chen, X Bai - Proceedings of the AAAI …, 2020 - ojs.aaai.org
Recently, segmentation-based methods are quite popular in scene text detection, as the
segmentation results can more accurately describe scene text of various shapes such as …

Gliding vertex on the horizontal bounding box for multi-oriented object detection

Y Xu, M Fu, Q Wang, Y Wang, K Chen… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
Object detection has recently experienced substantial progress. Yet, the widely adopted
horizontal bounding box representation is not appropriate for ubiquitous oriented objects …

Tooncrafter: Generative cartoon interpolation

J **ng, H Liu, M **a, Y Zhang, X Wang, Y Shan… - ACM Transactions on …, 2024 - dl.acm.org
We introduce ToonCrafter, a novel approach that transcends traditional correspondence-
based cartoon video interpolation, paving the way for generative interpolation. Traditional …

Abcnet: Real-time scene text spotting with adaptive bezier-curve network

Y Liu, H Chen, C Shen, T He, L **… - proceedings of the …, 2020 - openaccess.thecvf.com
Scene text detection and recognition has received increasing research attention. Existing
methods can be roughly categorized into two groups: character-based and segmentation …