Extracting text from scanned Arabic books: a large-scale benchmark dataset and a fine-tuned Faster-R-CNN model

R Elanwar, W Qin, M Betke, D Wijaya - International Journal on Document …, 2021 - Springer
Datasets of documents in Arabic are urgently needed to promote computer vision and
natural language processing research that addresses the specifics of the language …

Arabic Documents Layout Analysis (ADLA) using Fine-tuned Faster RCN

L Aljiffry, H Al-Barhamtoshy, A Jamal… - 2022 20th …, 2022 - ieeexplore.ieee.org
At present, there is a massive interest in document digitization, image searching, and natural
language processing models, using different types of models. The first step in applying any …

The ASAR 2018 Competition on physical layout analysis of scanned Arabic books (PLA-SAB 2018)

R Elanwar, M Betke - … Workshop on Arabic and Derived Script …, 2018 - ieeexplore.ieee.org
Successful physical layout analysis (PLA) is a key factor in the performance of text
recognizers and many other applications. PLA solutions for scanned Arabic documents are …