A two-stage road sign detection and text recognition system based on YOLOv7

CC Hsieh, CH Hsu, WH Huang - Internet of Things, 2024 - Elsevier
We developed a two-stage traffic sign recognition system to enhance safety and prevent
tragic traffic incidents involving self-driving cars. In the first stage, YOLOv7 was employed as …

Quantitative Evaluation of the Pore and Window Sizes of Tissue Engineering Scaffolds on Scanning Electron Microscope Images Using Deep Learning

I Karaca, B Aldemir Dikici - ACS omega, 2024 - ACS Publications
The morphological characteristics of tissue engineering scaffolds, such as pore and window
diameters, are crucial, as they directly impact cell-material interactions, attachment …

Automated License Plate Detection and Recognition using YOLOv8 and OCR With Tello Drone Camera

H Fakhrurroja, D Pramesti… - … , Informatics and its …, 2023 - ieeexplore.ieee.org
This paper proposes an Automated License Plate Recognition (ALPR) system using
YOLOv8 and Optical Character Recognition (OCR). This system is developed with the …

Wagon and Container Codes Detection and Recognition based on YOLOv8

A Diaz-Diaz, F Parrilla, R De La Iglesia… - 2024 7th Iberian …, 2024 - ieeexplore.ieee.org
Railway systems have become an essential and growing element for freight transport sector
due to the low levels of pollution it offers compared to other alternatives. However, they have …

Real-Time Multi-class Helmet Violation Detection Using YOLOv8 with License Plate Recognition

A Charef, Z Jarir, M Quafafou - International Conference on Digital …, 2024 - Springer
In this paper, we propose an end-to-end approach to performing multi-class, real-time
detection, prior to the specific classes of motorcycles, helmets, and license plates in traffic …

[HTML][HTML] Leveraging Transformer-Based OCR Model with Generative Data Augmentation for Engineering Document Recognition

W Khallouli, MS Uddin, A Sousa-Poza, J Li, S Kovacic - Electronics, 2024 - mdpi.com
The long-standing practice of document-based engineering has resulted in the
accumulation of a large number of engineering documents across various industries …

Advancing Vehicle Plate Recognition: Multitasking Visual Language Models with VehiclePaliGemma

N AlDahoul, MJT Tan, RR Tera, HA Karim… - arxiv preprint arxiv …, 2024 - arxiv.org
License plate recognition (LPR) involves automated systems that utilize cameras and
computer vision to read vehicle license plates. Such plates collected through LPR can then …

Towards LLMCI-Multimodal AI for LLM-Vision UI Operation

H Barham, M Fasha - 2024 - researchsquare.com
Human-computer interaction (HCI) has evolved significantly, yet it still largely depends on
visual communication through screens and manual input devices. While this paradigm is …

AI-based Multimodal Resume Ranking Web Application for Large Scale Job Recruitment

MB Yazıcı, D Sabaz, W Elmasry - 2024 8th International Artificial …, 2024 - ieeexplore.ieee.org
This paper presents a resume-ranking web application that improves recruitment through
advanced deep-learning techniques. The system uses the YOLOv9 model fine-tuned with …