YOLO-v1 to YOLO-v8, the rise of YOLO and its complementary nature toward digital manufacturing and industrial defect detection
M Hussain - Machines, 2023 - mdpi.com
Since its inception in 2015, the YOLO (You Only Look Once) variant of object detectors has
rapidly grown, with the latest release of YOLO-v8 in January 2023. YOLO variants are …
rapidly grown, with the latest release of YOLO-v8 in January 2023. YOLO variants are …
A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends
Deep supervised learning algorithms typically require a large volume of labeled data to
achieve satisfactory performance. However, the process of collecting and labeling such data …
achieve satisfactory performance. However, the process of collecting and labeling such data …
Davit: Dual attention vision transformers
In this work, we introduce Dual Attention Vision Transformers (DaViT), a simple yet effective
vision transformer architecture that is able to capture global context while maintaining …
vision transformer architecture that is able to capture global context while maintaining …
Twins: Revisiting the design of spatial attention in vision transformers
Very recently, a variety of vision transformer architectures for dense prediction tasks have
been proposed and they show that the design of spatial attention is critical to their success in …
been proposed and they show that the design of spatial attention is critical to their success in …
Localvit: Bringing locality to vision transformers
We study how to introduce locality mechanisms into vision transformers. The transformer
network originates from machine translation and is particularly good at modelling long-range …
network originates from machine translation and is particularly good at modelling long-range …
Transformers in vision: A survey
Astounding results from Transformer models on natural language tasks have intrigued the
vision community to study their application to computer vision problems. Among their salient …
vision community to study their application to computer vision problems. Among their salient …
A survey on vision transformer
Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …
network mainly based on the self-attention mechanism. Thanks to its strong representation …
Transgan: Two pure transformers can make one strong gan, and that can scale up
The recent explosive interest on transformers has suggested their potential to become
powerful``universal" models for computer vision tasks, such as classification, detection, and …
powerful``universal" models for computer vision tasks, such as classification, detection, and …
Autoformer: Searching transformers for visual recognition
Recently, pure transformer-based models have shown great potentials for vision tasks such
as image classification and detection. However, the design of transformer networks is …
as image classification and detection. However, the design of transformer networks is …
A survey on visual transformer
Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …
network mainly based on the self-attention mechanism. Thanks to its strong representation …