HGR-Net: Hierarchical graph reasoning network for arbitrary shape scene text detection

H Bi, C Xu, C Shi, G Liu, H Zhang, Y Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
As a prerequisite step of scene text reading, scene text detection is known as a challenging
task due to natural scene text diversity and variability. Most existing methods either adopt …

A hybrid approach to document layout analysis for heterogeneous document images

Z Zhong, J Wang, H Sun, K Hu, E Zhang, L Sun… - … on Document Analysis …, 2023 - Springer
We present a new hybrid document layout analysis approach to simultaneously detecting
graphical page objects, group text-lines into text regions according to reading order, and …

LayoutFormer: Hierarchical Text Detection Towards Scene Text Understanding

M Liang, JW Ma, X Zhu, J Qin… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Existing scene text detectors generally focus on accurately detecting single-level (ie word-
level line-level or paragraph-level) text entities without exploring the relationships among …

Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis

J Wang, K Hu, Z Zhong, L Sun, Q Huo - arxiv preprint arxiv:2401.11874, 2024 - arxiv.org
Document structure analysis (aka document layout analysis) is crucial for understanding the
physical layout and logical structure of documents, with applications in information retrieval …

Dynamic Relation Transformer for Contextual Text Block Detection

J Wang, S Zhang, K Hu, C Ma, Z Zhong, L Sun… - … on Document Analysis …, 2024 - Springer
Abstract Contextual Text Block Detection (CTBD) is the task of identifying coherent text
blocks in complex natural scenes. Previous methodologies have treated CTBD as either a …

Arbitrary Shape Text Detection With Discrete Cosine Transform and CLIP for Urban Scene Perception in ITS

Z Chen - IEEE Transactions on Intelligent Transportation …, 2025 - ieeexplore.ieee.org
The safety and reliability of urban intelligent transportation systems (ITS) largely depend on
accurate and sufficient scene perception, especially detecting and understanding text …

DISGO: Automatic End-to-End Evaluation for Scene Text OCR

MY Hwang, Y Shi, A Ramchandani, G Pang… - arxiv preprint arxiv …, 2023 - arxiv.org
This paper discusses the challenges of optical character recognition (OCR) on natural
scenes, which is harder than OCR on documents due to the wild content and various image …

Block-level Text Spotting with LLMs

G Bannur, B Amrutur - arxiv preprint arxiv:2406.13208, 2024 - arxiv.org
Text spotting has seen tremendous progress in recent years yielding performant techniques
which can extract text at the character, word or line level. However, extracting blocks of text …