Mpdiou: a loss for efficient and accurate bounding box regression
S Ma, Y Xu - arxiv preprint arxiv:2307.07662, 2023 - arxiv.org
Bounding box regression (BBR) has been widely used in object detection and instance
segmentation, which is an important step in object localization. However, most of the existing …
segmentation, which is an important step in object localization. However, most of the existing …
Pyramid vision transformer: A versatile backbone for dense prediction without convolutions
Although convolutional neural networks (CNNs) have achieved great success in computer
vision, this work investigates a simpler, convolution-free backbone network useful for many …
vision, this work investigates a simpler, convolution-free backbone network useful for many …
Deepsolo: Let transformer decoder with explicit points solo for text spotting
End-to-end text spotting aims to integrate scene text detection and recognition into a unified
framework. Dealing with the relationship between the two sub-tasks plays a pivotal role in …
framework. Dealing with the relationship between the two sub-tasks plays a pivotal role in …
Swintextspotter: Scene text spotting via better synergy between text detection and text recognition
End-to-end scene text spotting has attracted great attention in recent years due to the
success of excavating the intrinsic synergy of the scene text detection and recognition …
success of excavating the intrinsic synergy of the scene text detection and recognition …
Estextspotter: Towards better scene text spotting with explicit synergy in transformer
In recent years, end-to-end scene text spotting approaches are evolving to the Transformer-
based framework. While previous studies have shown the crucial importance of the intrinsic …
based framework. While previous studies have shown the crucial importance of the intrinsic …
Adapting a swin transformer for license plate number and text detection in drone images
The use of drones and unmanned aerial vehicles has significantly increased in various real-
world applications such as monitoring illegal car parking, tracing vehicles, controlling traffic …
world applications such as monitoring illegal car parking, tracing vehicles, controlling traffic …
Abinet++: Autonomous, bidirectional and iterative language modeling for scene text spotting
Scene text spotting is of great importance to the computer vision community due to its wide
variety of applications. Recent methods attempt to introduce linguistic knowledge for …
variety of applications. Recent methods attempt to introduce linguistic knowledge for …
On the arbitrary-oriented object detection: Classification based approaches revisited
Arbitrary-oriented object detection has been a building block for rotation sensitive tasks. We
first show that the boundary problem suffered in existing dominant regression-based rotation …
first show that the boundary problem suffered in existing dominant regression-based rotation …
Spts v2: single-point scene text spotting
End-to-end scene text spotting has made significant progress due to its intrinsic synergy
between text detection and recognition. Previous methods commonly regard manual …
between text detection and recognition. Previous methods commonly regard manual …
Weakly supervised scene text generation for low-resource languages
A large number of annotated training images is crucial for training successful scene text
recognition models. However, collecting sufficient datasets can be a labor-intensive and …
recognition models. However, collecting sufficient datasets can be a labor-intensive and …