A comprehensive review of object detection with deep learning

R Kaur, S Singh - Digital Signal Processing, 2023 - Elsevier
In the realm of computer vision, Deep Convolutional Neural Networks (DCNNs) have
demonstrated excellent performance. Video Processing, Object Detection, Image …

Tools, techniques, datasets and application areas for object detection in an image: a review

J Kaur, W Singh - Multimedia Tools and Applications, 2022 - Springer
Object detection is one of the most fundamental and challenging tasks to locate objects in
images and videos. Over the past, it has gained much attention to do more research on …

SEED-Bench: Benchmarking Multimodal Large Language Models

B Li, Y Ge, Y Ge, G Wang, R Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Multimodal large language models (MLLMs) building upon the foundation of powerful large
language models (LLMs) have recently demonstrated exceptional capabilities in generating …

What is wrong with scene text recognition model comparisons? dataset and model analysis

J Baek, G Kim, J Lee, S Park, D Han… - Proceedings of the …, 2019 - openaccess.thecvf.com
Many new proposals for scene text recognition (STR) models have been introduced in
recent years. While each claim to have pushed the boundary of the technology, a holistic …

Mask textspotter: An end-to-end trainable neural network for spotting text with arbitrary shapes

P Lyu, M Liao, C Yao, W Wu… - Proceedings of the …, 2018 - openaccess.thecvf.com
Recently, models based on deep neural networks have dominated the fields of scene text
detection and recognition. In this paper, we investigate the problem of scene text spotting …

Aster: An attentional scene text recognizer with flexible rectification

B Shi, M Yang, X Wang, P Lyu, C Yao… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
A challenging aspect of scene text recognition is to handle text with distortions or irregular
layout. In particular, perspective text and curved text are common in natural scenes and are …

Moran: A multi-object rectified attention network for scene text recognition

C Luo, L **, Z Sun - Pattern Recognition, 2019 - Elsevier
Irregular text is widely used. However, it is considerably difficult to recognize because of its
various shapes and distorted patterns. In this paper, we thus propose a multi-object rectified …

Arbitrary-oriented scene text detection via rotation proposals

J Ma, W Shao, H Ye, L Wang, H Wang… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
This paper introduces a novel rotation-based framework for arbitrary-oriented text detection
in natural scene images. We present the Rotation Region Proposal Networks, which are …

Decoupled attention network for text recognition

T Wang, Y Zhu, L **, C Luo, X Chen, Y Wu… - Proceedings of the …, 2020 - ojs.aaai.org
Text recognition has attracted considerable research interests because of its various
applications. The cutting-edge text recognition methods are based on attention mechanisms …

Vision transformer for fast and efficient scene text recognition

R Atienza - International conference on document analysis and …, 2021 - Springer
Scene text recognition (STR) enables computers to read text in natural scenes such as
object labels, road signs and instructions. STR helps machines perform informed decisions …