Deep imbalanced learning for multimodal emotion recognition in conversations
The main task of multimodal emotion recognition in conversations (MERC) is to identify the
emotions in modalities, eg, text, audio, image, and video, which is a significant development …
emotions in modalities, eg, text, audio, image, and video, which is a significant development …
Adapting a swin transformer for license plate number and text detection in drone images
The use of drones and unmanned aerial vehicles has significantly increased in various real-
world applications such as monitoring illegal car parking, tracing vehicles, controlling traffic …
world applications such as monitoring illegal car parking, tracing vehicles, controlling traffic …
A decade: review of scene text detection methods
E Rainarli - Computer Science Review, 2021 - Elsevier
The rapid development of scene text detection shows us the need for text recognition in a
scene image. Road signs recognition, reading the scene image for machine translation, text …
scene image. Road signs recognition, reading the scene image for machine translation, text …
ACE: Anchor-free corner evolution for real-time arbitrarily-oriented object detection
Objects with different orientations are ubiquitous in the real world (eg, texts/hands in the
scene image, objects in the aerial image, etc.), and the widely-used axis-aligned bounding …
scene image, objects in the aerial image, etc.), and the widely-used axis-aligned bounding …
HGR-Net: Hierarchical graph reasoning network for arbitrary shape scene text detection
As a prerequisite step of scene text reading, scene text detection is known as a challenging
task due to natural scene text diversity and variability. Most existing methods either adopt …
task due to natural scene text diversity and variability. Most existing methods either adopt …
Textdct: Arbitrary-shaped text detection via discrete cosine transform mask
Arbitrary-shaped scene text detection is a challenging task due to the variety of text changes
in font, size, color, and orientation. Most existing regression based methods resort to regress …
in font, size, color, and orientation. Most existing regression based methods resort to regress …
Text growing on leaf
Irregular-shaped texts bring challenges to Scene Text Detection (STD). Although existing
regression-based approaches achieve comparable performances, they fail to cover some …
regression-based approaches achieve comparable performances, they fail to cover some …
An end-to-end model for multi-view scene text recognition
Due to the increasing applications of surveillance and monitoring such as person re-
identification, vehicle re-identification and sports events tracking, the necessity of text …
identification, vehicle re-identification and sports events tracking, the necessity of text …
Morphtext: Deep morphology regularized accurate arbitrary-shape scene text detection
Bottom-up text detection methods play an important role in arbitrary-shape scene text
detection but there are two restrictions preventing them from achieving their great potential …
detection but there are two restrictions preventing them from achieving their great potential …
Granularity-aware single-point scene text spotting with sequential recurrence self-attention
Scene text spotting, a unified framework between text detection and text recognition, has
made great progress in recent years. Existing methods usually adopt the fully-supervised …
made great progress in recent years. Existing methods usually adopt the fully-supervised …