Object detection using YOLO: Challenges, architectural successors, datasets and applications

T Diwan, G Anirudh, JV Tembhurne - multimedia Tools and Applications, 2023 - Springer
Object detection is one of the predominant and challenging problems in computer vision.
Over the decade, with the expeditious evolution of deep learning, researchers have …

A comprehensive review of object detection with deep learning

R Kaur, S Singh - Digital Signal Processing, 2023 - Elsevier
In the realm of computer vision, Deep Convolutional Neural Networks (DCNNs) have
demonstrated excellent performance. Video Processing, Object Detection, Image …

Towards large-scale small object detection: Survey and benchmarks

G Cheng, X Yuan, X Yao, K Yan, Q Zeng… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
With the rise of deep convolutional neural networks, object detection has achieved
prominent advances in past years. However, such prosperity could not camouflage the …

Activating more pixels in image super-resolution transformer

X Chen, X Wang, J Zhou, Y Qiao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Transformer-based methods have shown impressive performance in low-level vision tasks,
such as image super-resolution. However, we find that these networks can only utilize a …

YOLOWeeds: A novel benchmark of YOLO object detectors for multi-class weed detection in cotton production systems

F Dang, D Chen, Y Lu, Z Li - Computers and Electronics in Agriculture, 2023 - Elsevier
Weeds are among the major threats to cotton production. Overreliance on herbicides for
weed control has accelerated the evolution of herbicide-resistance in weeds and caused …

Object detection using deep learning, CNNs and vision transformers: A review

AB Amjoud, M Amrouch - IEEE Access, 2023 - ieeexplore.ieee.org
Detecting objects remains one of computer vision and image understanding applications'
most fundamental and challenging aspects. Significant advances in object detection have …

Aligning bag of regions for open-vocabulary object detection

S Wu, W Zhang, S **, W Liu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Pre-trained vision-language models (VLMs) learn to align vision and language
representations on large-scale datasets, where each image-text pair usually contains a bag …

Yolov1 to v8: Unveiling each variant–a comprehensive review of yolo

M Hussain - IEEE access, 2024 - ieeexplore.ieee.org
This paper implements a systematic methodological approach to review the evolution of
YOLO variants. Each variant is dissected by examining its internal architectural composition …

Efficient long-range attention network for image super-resolution

X Zhang, H Zeng, S Guo, L Zhang - European conference on computer …, 2022 - Springer
Recently, transformer-based methods have demonstrated impressive results in various
vision tasks, including image super-resolution (SR), by exploiting the self-attention (SA) for …

Image data augmentation for deep learning: A survey

S Yang, W **ao, M Zhang, S Guo, J Zhao… - arxiv preprint arxiv …, 2022 - arxiv.org
Deep learning has achieved remarkable results in many computer vision tasks. Deep neural
networks typically rely on large amounts of training data to avoid overfitting. However …