Object detection using deep learning, CNNs and vision transformers: A review

AB Amjoud, M Amrouch - IEEE Access, 2023 - ieeexplore.ieee.org
Detecting objects remains one of computer vision and image understanding applications'
most fundamental and challenging aspects. Significant advances in object detection have …

Tiny object detection with context enhancement and feature purification

J **ao, H Guo, J Zhou, T Zhao, Q Yu, Y Chen… - Expert Systems with …, 2023 - Elsevier
Tiny object detection is one of the challenges in the field of object detection, which can be
applied in a variety of fields. Thanks to the advances in deep learning, significant …

Yolov4: Optimal speed and accuracy of object detection

A Bochkovskiy, CY Wang, HYM Liao - arxiv preprint arxiv:2004.10934, 2020 - arxiv.org
There are a huge number of features which are said to improve Convolutional Neural
Network (CNN) accuracy. Practical testing of combinations of such features on large …

CSPNet: A new backbone that can enhance learning capability of CNN

CY Wang, HYM Liao, YH Wu… - Proceedings of the …, 2020 - openaccess.thecvf.com
Neural networks have enabled state-of-the-art approaches to achieve incredible results on
computer vision tasks such as object detection. However, such success greatly relies on …

Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection

S Zhang, C Chi, Y Yao, Z Lei… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Object detection has been dominated by anchor-based detectors for several years.
Recently, anchor-free detectors have become popular due to the proposal of FPN and Focal …

Sipmask: Spatial information preservation for fast image and video instance segmentation

J Cao, RM Anwer, H Cholakkal, FS Khan… - Computer Vision–ECCV …, 2020 - Springer
Single-stage instance segmentation approaches have recently gained popularity due to
their speed and simplicity, but are still lagging behind in accuracy, compared to two-stage …

Good visual guidance makes a better extractor: Hierarchical visual prefix for multimodal entity and relation extraction

X Chen, N Zhang, L Li, Y Yao, S Deng, C Tan… - arxiv preprint arxiv …, 2022 - arxiv.org
Multimodal named entity recognition and relation extraction (MNER and MRE) is a
fundamental and crucial branch in information extraction. However, existing approaches for …

Learning human-object interaction detection using interaction points

T Wang, T Yang, M Danelljan… - Proceedings of the …, 2020 - openaccess.thecvf.com
Understanding interactions between humans and objects is one of the fundamental
problems in visual classification and an essential step towards detailed scene …

D2det: Towards high quality object detection and instance segmentation

J Cao, H Cholakkal, RM Anwer… - Proceedings of the …, 2020 - openaccess.thecvf.com
We propose a novel two-stage detection method, D2Det, that collectively addresses both
precise localization and accurate classification. For precise localization, we introduce a …

Concrete crack detection using lightweight attention feature fusion single shot multibox detector

W Zhu, H Zhang, J Eastwood, X Qi, J Jia… - Knowledge-Based Systems, 2023 - Elsevier
As one of the most important defects of concrete, cracks seriously threaten the service life
and safety of concrete structures, and various safety incidents caused by the collapse of …