A survey of OCR in Arabic language: applications, techniques, and challenges

S Faizullah, MS Ayub, S Hussain, MA Khan - Applied Sciences, 2023 - mdpi.com
Optical character recognition (OCR) is the process of extracting handwritten or printed text
from a scanned or printed image and converting it to a machine-readable form for further …

[HTML][HTML] Advancing ocr accuracy in image-to-latex conversion—a critical and creative exploration

EZ Orji, A Haydar, İ Erşan, OO Mwambe - Applied Sciences, 2023 - mdpi.com
This paper comprehensively assesses the application of active learning strategies to
enhance natural language processing-based optical character recognition (OCR) models for …

Exploring AI-driven approaches for unstructured document analysis and future horizons

SV Mahadevkar, S Patil, K Kotecha, LW Soong… - Journal of Big Data, 2024 - Springer
In the current industrial landscape, a significant number of sectors are grappling with the
challenges posed by unstructured data, which incurs financial losses amounting to millions …

F2M: Ensemble-based uncertainty estimation model for fire detection in indoor environments

M Arlović, M Patel, J Balen, F Hržić - Engineering applications of artificial …, 2024 - Elsevier
Early fire detection and timely notification are paramount for preventing human and material
casualties caused by fire. As a result, scientists have developed various fire monitoring …

Applicability of ocr engines for text recognition in vehicle number plates, receipts and handwriting

U Poudel, AM Regmi, Z Stamenkovic… - Journal of Circuits …, 2023 - World Scientific
Optical character recognition (OCR) is a computer vision technique that enables computers
to recognize text from images. Text detection and computer vision have made significant …

[HTML][HTML] Region Segmentation of Images Based on a Raster-Scan Paradigm

L Lukač, A Nerat, D Strnad, Š Horvat… - Journal of Sensor and …, 2024 - mdpi.com
This paper introduces a new method for the region segmentation of images. The approach is
based on the raster-scan paradigm and builds the segments incrementally. The pixels are …

Detection of redacted text in legal documents

R van Heusden, A de Ruijter, R Majoor… - … Conference on Theory …, 2023 - Springer
We present a technique for automatically detecting redacted text in legal documents, using a
combination of Optical Character Recognition (OCR) and morphological operations from the …

PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents

N Zhang, C Heaton, ST Okonsky, P Mitra… - arxiv preprint arxiv …, 2024 - arxiv.org
Optical Character Recognition (OCR) is an established task with the objective of identifying
the text present in an image. While many off-the-shelf OCR models exist, they are often …

[HTML][HTML] An efficient method for disaster tweets classification using gradient-based optimized convolutional neural networks with BERT embeddings

D Dharrao, MR Aadithyanarayanan, R Mital, A Vengali… - MethodsX, 2024 - Elsevier
Event of the disastrous scenarios are actively discussed on microblogging platforms like
Twitter which can lead to chaotic situations. In the era of machine learning and deep …