A survey of OCR in Arabic language: applications, techniques, and challenges
Optical character recognition (OCR) is the process of extracting handwritten or printed text
from a scanned or printed image and converting it to a machine-readable form for further …
from a scanned or printed image and converting it to a machine-readable form for further …
Exploring AI-driven approaches for unstructured document analysis and future horizons
In the current industrial landscape, a significant number of sectors are grappling with the
challenges posed by unstructured data, which incurs financial losses amounting to millions …
challenges posed by unstructured data, which incurs financial losses amounting to millions …
Advancing OCR Accuracy in Image-to-LaTeX Conversion—A Critical and Creative Exploration
This paper comprehensively assesses the application of active learning strategies to
enhance natural language processing-based optical character recognition (OCR) models for …
enhance natural language processing-based optical character recognition (OCR) models for …
F2M: Ensemble-based uncertainty estimation model for fire detection in indoor environments
Early fire detection and timely notification are paramount for preventing human and material
casualties caused by fire. As a result, scientists have developed various fire monitoring …
casualties caused by fire. As a result, scientists have developed various fire monitoring …
[HTML][HTML] An efficient method for disaster tweets classification using gradient-based optimized convolutional neural networks with BERT embeddings
D Dharrao, MR Aadithyanarayanan, R Mital, A Vengali… - MethodsX, 2024 - Elsevier
Event of the disastrous scenarios are actively discussed on microblogging platforms like
Twitter which can lead to chaotic situations. In the era of machine learning and deep …
Twitter which can lead to chaotic situations. In the era of machine learning and deep …
A comparison of deep transfer learning backbone architecture techniques for printed text detection of different font styles from unstructured documents
Object detection methods based on deep learning have been used in a variety of sectors
including banking, healthcare, e-governance, and academia. In recent years, there has …
including banking, healthcare, e-governance, and academia. In recent years, there has …
[HTML][HTML] Region Segmentation of Images Based on a Raster-Scan Paradigm
This paper introduces a new method for the region segmentation of images. The approach is
based on the raster-scan paradigm and builds the segments incrementally. The pixels are …
based on the raster-scan paradigm and builds the segments incrementally. The pixels are …
PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents
Optical Character Recognition (OCR) is an established task with the objective of identifying
the text present in an image. While many off-the-shelf OCR models exist, they are often …
the text present in an image. While many off-the-shelf OCR models exist, they are often …
Sentiment Analysis of Beauty Product Reviews Using the IndoBERT Method and Naive Bayes Classification
HM Ramdhan, MD Purbolaksono… - … on Information and …, 2024 - ieeexplore.ieee.org
This paper presents an integrated approach to sentiment analysis of beauty product reviews
using the IndoBERT model combined with Naive Bayes classification, which specifically …
using the IndoBERT model combined with Naive Bayes classification, which specifically …
[PDF][PDF] Applicability of OCR Engines for Text Recognition in Vehicle Number Plates, Receipts and Handwriting.
U. Poudel et al. experiments conducted on five different image categories: vehicle number
plates, receipts, handwriting, symbols and plain text images. Evaluation metrics such as …
plates, receipts, handwriting, symbols and plain text images. Evaluation metrics such as …