Degraded historical document binarization: A review on issues, challenges, techniques, and future directions
In this era of digitization, most hardcopy documents are being transformed into digital
formats. In the process of transformation, large quantities of documents are stored and …
formats. In the process of transformation, large quantities of documents are stored and …
Historical document image binarization: A review
This review provides a comprehensive view of the field of historical document image
binarization with a focus on the contributions made in the last decade. After the introduction …
binarization with a focus on the contributions made in the last decade. After the introduction …
Document image binarization with fully convolutional neural networks
Binarization of degraded historical manuscript images is an important pre-processing step
for many document processing tasks. We formulate binarization as a pixel classification …
for many document processing tasks. We formulate binarization as a pixel classification …
Docentr: An end-to-end document image enhancement transformer
Document images can be affected by many degradation scenarios, which cause recognition
and processing difficulties. In this age of digitization, it is important to denoise them for …
and processing difficulties. In this age of digitization, it is important to denoise them for …
Enhance to read better: a multi-task adversarial network for handwritten document image enhancement
Handwritten document images can be highly affected by degradation for different reasons:
Paper ageing, daily-life scenarios (wrinkles, dust, etc.), bad scanning process and so on …
Paper ageing, daily-life scenarios (wrinkles, dust, etc.), bad scanning process and so on …
A selectional auto-encoder approach for document image binarization
Binarization plays a key role in the automatic information retrieval from document images.
This process is usually performed in the first stages of document analysis systems, and …
This process is usually performed in the first stages of document analysis systems, and …
A survey of historical document image datasets
This paper presents a systematic literature review of image datasets for document image
analysis, focusing on historical documents, such as handwritten manuscripts and early …
analysis, focusing on historical documents, such as handwritten manuscripts and early …
Autopart: Automating schema design for large scientific databases using data partitioning
S Papadomanolakis, A Ailamaki - … International Conference on …, 2004 - ieeexplore.ieee.org
Database applications that use multi-terabyte datasets are becoming increasingly important
for scientific fields such as astronomy and biology. Scientific databases are particularly …
for scientific fields such as astronomy and biology. Scientific databases are particularly …
Text-DIAE: a self-supervised degradation invariant autoencoder for text recognition and document enhancement
In this paper, we propose a Text-Degradation Invariant Auto Encoder (Text-DIAE), a self-
supervised model designed to tackle two tasks, text recognition (handwritten or scene-text) …
supervised model designed to tackle two tasks, text recognition (handwritten or scene-text) …
Generate, transform, and clean: the role of GANs and transformers in palm leaf manuscript generation and enhancement
Palm leaf manuscripts offer a rich source of data critical for document analysis tasks,
including character, word, and text analysis. However, their cleaning and denoising present …
including character, word, and text analysis. However, their cleaning and denoising present …