HGR-Net: Hierarchical graph reasoning network for arbitrary shape scene text detection
As a prerequisite step of scene text reading, scene text detection is known as a challenging
task due to natural scene text diversity and variability. Most existing methods either adopt …
task due to natural scene text diversity and variability. Most existing methods either adopt …
A hybrid approach to document layout analysis for heterogeneous document images
We present a new hybrid document layout analysis approach to simultaneously detecting
graphical page objects, group text-lines into text regions according to reading order, and …
graphical page objects, group text-lines into text regions according to reading order, and …
LayoutFormer: Hierarchical Text Detection Towards Scene Text Understanding
Existing scene text detectors generally focus on accurately detecting single-level (ie word-
level line-level or paragraph-level) text entities without exploring the relationships among …
level line-level or paragraph-level) text entities without exploring the relationships among …
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis
Document structure analysis (aka document layout analysis) is crucial for understanding the
physical layout and logical structure of documents, with applications in information retrieval …
physical layout and logical structure of documents, with applications in information retrieval …
Dynamic Relation Transformer for Contextual Text Block Detection
Abstract Contextual Text Block Detection (CTBD) is the task of identifying coherent text
blocks in complex natural scenes. Previous methodologies have treated CTBD as either a …
blocks in complex natural scenes. Previous methodologies have treated CTBD as either a …
Arbitrary Shape Text Detection With Discrete Cosine Transform and CLIP for Urban Scene Perception in ITS
Z Chen - IEEE Transactions on Intelligent Transportation …, 2025 - ieeexplore.ieee.org
The safety and reliability of urban intelligent transportation systems (ITS) largely depend on
accurate and sufficient scene perception, especially detecting and understanding text …
accurate and sufficient scene perception, especially detecting and understanding text …
DISGO: Automatic End-to-End Evaluation for Scene Text OCR
This paper discusses the challenges of optical character recognition (OCR) on natural
scenes, which is harder than OCR on documents due to the wild content and various image …
scenes, which is harder than OCR on documents due to the wild content and various image …
Block-level Text Spotting with LLMs
G Bannur, B Amrutur - arxiv preprint arxiv:2406.13208, 2024 - arxiv.org
Text spotting has seen tremendous progress in recent years yielding performant techniques
which can extract text at the character, word or line level. However, extracting blocks of text …
which can extract text at the character, word or line level. However, extracting blocks of text …