Object detection using deep learning, CNNs and vision transformers: A review
Detecting objects remains one of computer vision and image understanding applications'
most fundamental and challenging aspects. Significant advances in object detection have …
most fundamental and challenging aspects. Significant advances in object detection have …
Tiny object detection with context enhancement and feature purification
Tiny object detection is one of the challenges in the field of object detection, which can be
applied in a variety of fields. Thanks to the advances in deep learning, significant …
applied in a variety of fields. Thanks to the advances in deep learning, significant …
Yolov4: Optimal speed and accuracy of object detection
There are a huge number of features which are said to improve Convolutional Neural
Network (CNN) accuracy. Practical testing of combinations of such features on large …
Network (CNN) accuracy. Practical testing of combinations of such features on large …
CSPNet: A new backbone that can enhance learning capability of CNN
Neural networks have enabled state-of-the-art approaches to achieve incredible results on
computer vision tasks such as object detection. However, such success greatly relies on …
computer vision tasks such as object detection. However, such success greatly relies on …
Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection
Object detection has been dominated by anchor-based detectors for several years.
Recently, anchor-free detectors have become popular due to the proposal of FPN and Focal …
Recently, anchor-free detectors have become popular due to the proposal of FPN and Focal …
Sipmask: Spatial information preservation for fast image and video instance segmentation
Single-stage instance segmentation approaches have recently gained popularity due to
their speed and simplicity, but are still lagging behind in accuracy, compared to two-stage …
their speed and simplicity, but are still lagging behind in accuracy, compared to two-stage …
Good visual guidance makes a better extractor: Hierarchical visual prefix for multimodal entity and relation extraction
Multimodal named entity recognition and relation extraction (MNER and MRE) is a
fundamental and crucial branch in information extraction. However, existing approaches for …
fundamental and crucial branch in information extraction. However, existing approaches for …
Learning human-object interaction detection using interaction points
Understanding interactions between humans and objects is one of the fundamental
problems in visual classification and an essential step towards detailed scene …
problems in visual classification and an essential step towards detailed scene …
D2det: Towards high quality object detection and instance segmentation
We propose a novel two-stage detection method, D2Det, that collectively addresses both
precise localization and accurate classification. For precise localization, we introduce a …
precise localization and accurate classification. For precise localization, we introduce a …
Concrete crack detection using lightweight attention feature fusion single shot multibox detector
As one of the most important defects of concrete, cracks seriously threaten the service life
and safety of concrete structures, and various safety incidents caused by the collapse of …
and safety of concrete structures, and various safety incidents caused by the collapse of …