Text recognition in the wild: A survey

X Chen, L **, Y Zhu, C Luo, T Wang - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
The history of text can be traced back over thousands of years. Rich and precise semantic
information carried by text is important in a wide range of vision-based application …

A comprehensive study on deep learning-based methods for sign language recognition

N Adaloglou, T Chatzis, I Papastratis… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
In this paper, a comparative experimental assessment of computer vision-based methods for
sign language recognition is conducted. By implementing the most recent deep neural …

LISTER: neighbor decoding for length-insensitive scene text recognition

C Cheng, P Wang, C Da, Q Zheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
The diversity in length constitutes a significant characteristic of text. Due to the long-tail
distribution of text lengths, most existing methods for scene text recognition (STR) only work …

Distilling cross-temporal contexts for continuous sign language recognition

L Guo, W Xue, Q Guo, B Liu, K Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Continuous sign language recognition (CSLR) aims to recognize glosses in a sign language
video. State-of-the-art methods typically have two modules, a spatial perception module and …

An attention-based convolutional recurrent neural networks for scene text recognition

AAA Alshawi, J Tanha, MA Balafar - IEEE Access, 2024 - ieeexplore.ieee.org
Text recognition is critical in various domains, including driving assistance, handwriting
recognition, and aiding the visually impaired. In recent years, deep learning-based methods …

RealTranS: End-to-end simultaneous speech translation with convolutional weighted-shrinking transformer

X Zeng, L Li, Q Liu - arxiv preprint arxiv:2106.04833, 2021 - arxiv.org
End-to-end simultaneous speech translation (SST), which directly translates speech in one
language into text in another language in real-time, is useful in many scenarios but has not …

Self-distillation regularized connectionist temporal classification loss for text recognition: A simple yet effective approach

Z Zhang, N Lu, M Liao, Y Huang, C Li… - Proceedings of the …, 2024 - ojs.aaai.org
Text recognition methods are gaining rapid development. Some advanced techniques, eg,
powerful modules, language models, and un-and semi-supervised learning schemes …

Deep radial embedding for visual sequence learning

Y Min, P Jiao, Y Li, X Wang, L Lei, X Chai… - European conference on …, 2022 - Springer
Abstract Connectionist Temporal Classification (CTC) is a popular objective function in
sequence recognition, which provides supervision for unsegmented sequence data through …

Less peaky and more accurate CTC forced alignment by label priors

R Huang, X Zhang, Z Ni, L Sun, M Hira… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Connectionist temporal classification (CTC) models are known to have peaky output
distributions. Such behavior is not a problem for automatic speech recognition (ASR), but it …

Rfwash: a weakly supervised tracking of hand hygiene technique

A Khamis, B Kusy, CT Chou, ML McLaws… - Proceedings of the 18th …, 2020 - dl.acm.org
Each year, hundreds of thousands of people contract Healthcare Associated Infections
(HAIs). Poor hand hygiene compliance among healthcare workers is thought to be the …