Trocr: Transformer-based optical character recognition with pre-trained models

M Li, T Lv, J Chen, L Cui, Y Lu, D Florencio… - Proceedings of the …, 2023 - ojs.aaai.org
Text recognition is a long-standing research problem for document digitalization. Existing
approaches are usually built based on CNN for image understanding and RNN for char …

Dtrocr: Decoder-only transformer for optical character recognition

M Fujitake - Proceedings of the IEEE/CVF winter conference …, 2024 - openaccess.thecvf.com
Typical text recognition methods rely on an encoder-decoder structure, in which the encoder
extracts features from an image, and the decoder produces recognized text from these …

Dan: a segmentation-free document attention network for handwritten document recognition

D Coquenet, C Chatelain… - IEEE transactions on …, 2023 - ieeexplore.ieee.org
Unconstrained handwritten text recognition is a challenging computer vision task. It is
traditionally handled by a two-step approach, combining line segmentation followed by text …

Data augmentation for offline handwritten text recognition: a systematic literature review

AF de Sousa Neto, BLD Bezerra, GCD de Moura… - SN Computer …, 2024 - Springer
Abstract Offline Handwritten Text Recognition (HTR) systems concern the automatic
recognition and transcription of handwritten text from scanned images to digital media …

Handwritten mathematical expression recognition with bidirectionally trained transformer

W Zhao, L Gao, Z Yan, S Peng, L Du… - Document analysis and …, 2021 - Springer
Encoder-decoder models have made great progress on handwritten mathematical
expression recognition recently. However, it is still a challenge for existing methods to …

Digitizing history: transitioning historical paper documents to digital content for information retrieval and mining—a comprehensive survey

N Girdhar, M Coustaty, A Doucet - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Historical document processing (HDP) corresponds to the task of converting the physical-
bind form of historical archives into a web-based centrally digitized form for their …

Content and style aware generation of text-line images for handwriting recognition

L Kang, P Riba, M Rusinol, A Fornes… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Handwritten Text Recognition has achieved an impressive performance in public
benchmarks. However, due to the high inter-and intra-class variability between handwriting …

Transformer-based approach for joint handwriting and named entity recognition in historical document

AC Rouhou, M Dhiaf, Y Kessentini, SB Salem - Pattern Recognition Letters, 2022 - Elsevier
The extraction of relevant information carried out by named entities in handwriting
documents is still a challenging task. Unlike traditional information extraction approaches …

Benchmarking YOLOv5 and YOLOv7 models with DeepSORT for droplet tracking applications

M Durve, S Orsini, A Tiribocchi, A Montessori… - The European Physical …, 2023 - Springer
Tracking droplets in microfluidics is a challenging task. The difficulty arises in choosing a
tool to analyze general microfluidic videos to infer physical quantities. The state-of-the-art …

Transformer for handwritten text recognition using bidirectional post-decoding

C Wick, J Zöllner, T Grüning - International Conference on Document …, 2021 - Springer
Most recently, Transformers–which are recurrent-free neural network architectures–
achieved tremendous performances on various Natural Language Processing (NLP) tasks …