Transformer-based text detection in the wild

Z Raisi, MA Naiel, G Younes… - Proceedings of the …, 2021 - openaccess.thecvf.com
A major limitation to most state-of-the-art visual localization methods is their ineptitude to
make use of ubiquitous signs and directions that are typically intuitive to humans …

Danxe: An extended artificial intelligence framework to analyze and promote dance heritage

L Stacchio, S Garzarella, P Cascarano… - Digital Applications in …, 2024 - Elsevier
Motivated by the need to leverage technologies to enhance the preservation, accessibility,
quantitative data analysis, and valorization of Dance Heritage, this work introduces DanXe …

R-YOLO: A real-time text detector for natural scenes with arbitrary rotation

X Wang, S Zheng, C Zhang, R Li, L Gui - Sensors, 2021 - mdpi.com
Accurate and efficient text detection in natural scenes is a fundamental yet challenging task
in computer vision, especially when dealing with arbitrarily-oriented texts. Most …

[PDF][PDF] Deep Learning Techniques for Detecting and Segmenting Text in Natural Scene Images

A Hussein, MSM Altaei - Al-Nahrain Journal of Science, 2024 - iasj.net
Text detection and segmentation in natural scene images is an active research problem in
computer vision and document analysis. Unlike scanned documents, scene text exhibits …

Arbitrary shape text detection using transformers

Z Raisi, G Younes, J Zelek - 2022 26th International …, 2022 - ieeexplore.ieee.org
Recent text detection frameworks require several handcrafted components such as anchor
generation, non-maximum suppression (NMS), or multiple processing stages (eg label …

2lspe: 2d learnable sinusoidal positional encoding using transformer for scene text recognition

Z Raisi, MA Naiel, G Younes… - 2021 18th Conference …, 2021 - ieeexplore.ieee.org
Positional Encoding (PE) plays a vital role in a Transformer's ability to capture the order of
sequential information, allowing it to overcome the permutation equivarience property …

Multimodal transformer for comics text-cloze

E Vivoli, J Lafuente Baeza, E Valveny Llobet… - … on Document Analysis …, 2024 - Springer
This work explores a closure task in comics, a medium where visual and textual elements
are intricately intertwined. Specifically, text-cloze refers to the task of selecting the correct …

DC-PSENet: a novel scene text detection method integrating double ResNet-based and changed channels recursive feature pyramid

L Huang, S Liao, W Yang - The Visual Computer, 2024 - Springer
Due to the emergence and advancement of deep learning technologies, scene text
detection is becoming more widespread in various fields. However, due to the complexity of …

Occluded text detection and recognition in the wild

Z Raisi, J Zelek - 2022 19th Conference on Robots and Vision …, 2022 - ieeexplore.ieee.org
The performance of existing deep-learning scene text recognition-based methods fails
significantly on occluded text instances or even partially occluded characters in a text due to …

Recent Trends in Deep Learning-Based Optical Character Recognition

G Min, A Lee, KS Kim, JE Kim, HS Kang… - Electronics and …, 2022 - koreascience.kr
Optical character recognition is a primary technology required in different fields, including
digitizing archival documents, industrial automation, automatic driving, video analytics …