Text line and word segmentation of handwritten documents

G Louloudis, B Gatos, I Pratikakis, C Halatsis - Pattern recognition, 2009 - Elsevier
In this paper, we present a segmentation methodology of handwritten documents in their
distinct entities, namely, text lines and words. Text line segmentation is achieved by applying …

Printed Ottoman text recognition using synthetic data and data augmentation

EF Bilgin Tasdemir - International Journal on Document Analysis and …, 2023 - Springer
The Ottoman script, which was in use for over five centuries, is an Arabic alphabet-based
writing system. It became obsolete after the change of alphabet in Turkey. There are plenty …

Matching word images for content-based retrieval from printed document images

M Meshesha, CV Jawahar - International Journal of Document Analysis …, 2008 - Springer
As large quantity of document images is getting archived by the digital libraries, there is a
need for an efficient search strategies to make them available as per users information need …

SHIBR—The Swedish historical birth records: A semi-annotated dataset

A Cheddad, H Kusetogullari, A Hilmkil, L Sundin… - Neural Computing and …, 2021 - Springer
This paper presents a digital image dataset of historical handwritten birth records stored in
the archives of several parishes across Sweden, together with the corresponding metadata …

GAN-based text line segmentation method for challenging handwritten documents

İ Özşeker, AA Demir, U Özkaya - International Journal on Document …, 2024 - Springer
Text line segmentation (TLS) is an essential step of the end-to-end document analysis
systems. The main purpose of this step is to extract the individual text lines of any …

Matching ottoman words: an image retrieval approach to historical document indexing

E Ataer, P Duygulu - Proceedings of the 6th ACM International …, 2007 - dl.acm.org
Large archives of Ottoman documents are challenging to many historians all over the world.
However, these archives remain inaccessible since manual transcription of such a huge …

HAH manuscripts: A holistic paradigm for classifying and retrieving historical Arabic handwritten documents

Z Al Aghbari, S Brook - Expert Systems with Applications, 2009 - Elsevier
Technologies for reading and searching digital documents have helped academic
researchers; however, truly effective search engines for handwritten documents have not …

Efficient search in document image collections

A Kumar, CV Jawahar, R Manmatha - … 18-22, 2007, Proceedings, Part I 8, 2007 - Springer
This paper presents an efficient indexing and retrieval scheme for searching in document
image databases. In many non-European languages, optical character recognizers are not …

Efficient algorithms for text lines and words segmentation for recognition of Arabic handwritten script

AAA Ali, M Suresha - … and Applications: ERCICA 2018, Volume 1, 2019 - Springer
A new methodology for Arabic handwritten document images segmentation is done in this
paper to segment the documents into distinct entities as words and text lines. Based on …

A line-based representation for matching words in historical manuscripts

EF Can, P Duygulu - Pattern Recognition Letters, 2011 - Elsevier
In this study, we propose a new method for retrieving and recognizing words in historical
documents. We represent word images with a set of line segments. Then we provide a …