Deep learning-based detection from the perspective of small or tiny objects: A survey

K Tong, Y Wu - Image and Vision Computing, 2022 - Elsevier
Detecting small or tiny objects is always a difficult and challenging issue in computer vision.
In this paper, we provide a latest and comprehensive survey of deep learning-based …

Object detection using deep learning, CNNs and vision transformers: A review

AB Amjoud, M Amrouch - IEEE Access, 2023 - ieeexplore.ieee.org
Detecting objects remains one of computer vision and image understanding applications'
most fundamental and challenging aspects. Significant advances in object detection have …

YOLOv6: A single-stage object detection framework for industrial applications

C Li, L Li, H Jiang, K Weng, Y Geng, L Li, Z Ke… - arxiv preprint arxiv …, 2022 - arxiv.org
For years, the YOLO series has been the de facto industry-level standard for efficient object
detection. The YOLO community has prospered overwhelmingly to enrich its use in a …

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

CY Wang, A Bochkovskiy… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Real-time object detection is one of the most important research topics in computer vision.
As new approaches regarding architecture optimization and training optimization are …

Mixformer: End-to-end tracking with iterative mixed attention

Y Cui, C Jiang, L Wang, G Wu - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Tracking often uses a multi-stage pipeline of feature extraction, target information
integration, and bounding box estimation. To simplify this pipeline and unify the process of …

Tood: Task-aligned one-stage object detection

C Feng, Y Zhong, Y Gao, MR Scott… - 2021 IEEE/CVF …, 2021 - computer.org
One-stage object detection is commonly implemented by optimizing two sub-tasks: object
classification and localization, using heads with two parallel branches, which might lead to a …

Dense distinct query for end-to-end object detection

S Zhang, X Wang, J Wang, J Pang… - Proceedings of the …, 2023 - openaccess.thecvf.com
One-to-one label assignment in object detection has successfully obviated the need of non-
maximum suppression (NMS) as a postprocessing and makes the pipeline end-to-end …

Pyramid vision transformer: A versatile backbone for dense prediction without convolutions

W Wang, E **e, X Li, DP Fan, K Song… - Proceedings of the …, 2021 - openaccess.thecvf.com
Although convolutional neural networks (CNNs) have achieved great success in computer
vision, this work investigates a simpler, convolution-free backbone network useful for many …

RetinaNet with difference channel attention and adaptively spatial feature fusion for steel surface defect detection

X Cheng, J Yu - IEEE Transactions on Instrumentation and …, 2020 - ieeexplore.ieee.org
Surface defect detection of products is an important process to guarantee the quality of
industrial production. A defect detection task aims to identify the specific category and …

YOLO-MS: rethinking multi-scale representation learning for real-time object detection

Y Chen, X Yuan, J Wang, R Wu, X Li… - … on Pattern Analysis …, 2025 - ieeexplore.ieee.org
We aim at providing the object detection community with an efficient and performant object
detector, termed YOLO-MS. The core design is based on a series of investigations on how …