Text detection, tracking and recognition in video: a comprehensive survey
The intelligent analysis of video data is currently in wide demand because a video is a major
source of sensory data in our lives. Text is a prominent and direct source of information in …
source of sensory data in our lives. Text is a prominent and direct source of information in …
From object detection to text detection and recognition: A brief evolution history of optical character recognition
H Wang, C Pan, X Guo, C Ji… - Wiley Interdisciplinary …, 2021 - Wiley Online Library
Text detection and recognition, which is also known as optical character recognition (OCR),
is an active research area under quick development with a lot of exciting applications. Deep …
is an active research area under quick development with a lot of exciting applications. Deep …
Roadtext-1k: Text detection & recognition dataset for driving videos
Perceiving text is crucial to understand semantics of outdoor scenes and hence is a critical
requirement to build intelligent systems for driver assistance and self-driving. Most of the …
requirement to build intelligent systems for driver assistance and self-driving. Most of the …
End-to-end video text detection with online tracking
Text in videos usually acts as important semantic cues, which is helpful to video analysis.
Video text detection is considered as one of the most difficult tasks in document analysis due …
Video text detection is considered as one of the most difficult tasks in document analysis due …
Semantic-aware video text detection
Most existing video text detection methods track texts with appearance features, which are
easily influenced by the change of perspective and illumination. Compared with appearance …
easily influenced by the change of perspective and illumination. Compared with appearance …
A bilingual, openworld video text dataset and end-to-end video text spotter with transformer
Most existing video text spotting benchmarks focus on evaluating a single language and
scenario with limited data. In this work, we introduce a large-scale, Bilingual, Open World …
scenario with limited data. In this work, we introduce a large-scale, Bilingual, Open World …
End-to-end video text spotting with transformer
Recent video text spotting methods usually require the three-staged pipeline, ie, detecting
text in individual images, recognizing localized text, tracking text streams with post …
text in individual images, recognizing localized text, tracking text streams with post …
Free: A fast and robust end-to-end video text spotter
Currently, video text spotting tasks usually fall into the four-staged pipeline: detecting text
regions in individual images, recognizing localized text regions frame-wisely, tracking text …
regions in individual images, recognizing localized text regions frame-wisely, tracking text …
A unified framework for tracking based text detection and recognition from web videos
S Tian, XC Yin, Y Su, HW Hao - IEEE transactions on pattern …, 2017 - ieeexplore.ieee.org
Video text extraction plays an important role for multimedia understanding and retrieval.
Most previous research efforts are conducted within individual frames. A few of recent …
Most previous research efforts are conducted within individual frames. A few of recent …
T-HOG: An effective gradient-based descriptor for single line text regions
We discuss the use of histogram of oriented gradients (HOG) descriptors as an effective tool
for text description and recognition. Specifically, we propose a HOG-based texture descriptor …
for text description and recognition. Specifically, we propose a HOG-based texture descriptor …