Generative artificial intelligence: a systematic review and applications

SS Sengar, AB Hasan, S Kumar, F Carroll - Multimedia Tools and …, 2024 - Springer
In recent years, the study of artificial intelligence (AI) has undergone a paradigm shift. This
has been propelled by the groundbreaking capabilities of generative models both in …

Deepsolo: Let transformer decoder with explicit points solo for text spotting

M Ye, J Zhang, S Zhao, J Liu, T Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
End-to-end text spotting aims to integrate scene text detection and recognition into a unified
framework. Dealing with the relationship between the two sub-tasks plays a pivotal role in …

Swintextspotter: Scene text spotting via better synergy between text detection and text recognition

M Huang, Y Liu, Z Peng, C Liu, D Lin… - proceedings of the …, 2022 - openaccess.thecvf.com
End-to-end scene text spotting has attracted great attention in recent years due to the
success of excavating the intrinsic synergy of the scene text detection and recognition …

Text spotting transformers

X Zhang, Y Su, S Tripathi, Z Tu - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
In this paper, we present TExt Spotting TRansformers (TESTR), a generic end-to-end text
spotting framework using Transformers for text detection and recognition in the wild. TESTR …

Omniparser: A unified framework for text spotting key information extraction and table recognition

J Wan, S Song, W Yu, Y Liu, W Cheng… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recently visually-situated text parsing (VsTP) has experienced notable advancements
driven by the increasing demand for automated document understanding and the …

Estextspotter: Towards better scene text spotting with explicit synergy in transformer

M Huang, J Zhang, D Peng, H Lu… - Proceedings of the …, 2023 - openaccess.thecvf.com
In recent years, end-to-end scene text spotting approaches are evolving to the Transformer-
based framework. While previous studies have shown the crucial importance of the intrinsic …

Structext: Structured text understanding with multi-modal transformers

Y Li, Y Qian, Y Yu, X Qin, C Zhang, Y Liu… - Proceedings of the 29th …, 2021 - dl.acm.org
Structured text understanding on Visually Rich Documents (VRDs) is a crucial part of
Document Intelligence. Due to the complexity of content and layout in VRDs, structured text …

Abinet++: Autonomous, bidirectional and iterative language modeling for scene text spotting

S Fang, Z Mao, H **e, Y Wang, C Yan… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Scene text spotting is of great importance to the computer vision community due to its wide
variety of applications. Recent methods attempt to introduce linguistic knowledge for …

Spts v2: single-point scene text spotting

Y Liu, J Zhang, D Peng, M Huang… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
End-to-end scene text spotting has made significant progress due to its intrinsic synergy
between text detection and recognition. Previous methods commonly regard manual …

MMOCR: a comprehensive toolbox for text detection, recognition and understanding

Z Kuang, H Sun, Z Li, X Yue, TH Lin, J Chen… - Proceedings of the 29th …, 2021 - dl.acm.org
We present MMOCR---an open-source toolbox which provides a comprehensive pipeline for
text detection and recognition, as well as their downstream tasks such as named entity …