Avoiding overfitting: A survey on regularization methods for convolutional neural networks

CFGD Santos, JP Papa - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
Several image processing tasks, such as image classification and object detection, have
been significantly improved using Convolutional Neural Networks (CNN). Like ResNet and …

A survey of deep active learning

P Ren, Y **ao, X Chang, PY Huang, Z Li… - ACM computing …, 2021 - dl.acm.org
Active learning (AL) attempts to maximize a model's performance gain while annotating the
fewest samples possible. Deep learning (DL) is greedy for data and requires a large amount …

Detrs beat yolos on real-time object detection

Y Zhao, W Lv, S Xu, J Wei, G Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
The YOLO series has become the most popular framework for real-time object detection due
to its reasonable trade-off between speed and accuracy. However we observe that the …

Segnext: Rethinking convolutional attention design for semantic segmentation

MH Guo, CZ Lu, Q Hou, Z Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc
We present SegNeXt, a simple convolutional network architecture for semantic
segmentation. Recent transformer-based models have dominated the field of se-mantic …

Vision-language models for vision tasks: A survey

J Zhang, J Huang, S **, S Lu - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Most visual recognition studies rely heavily on crowd-labelled data in deep neural networks
(DNNs) training, and they usually train a DNN for each single visual recognition task …

Rtmdet: An empirical study of designing real-time object detectors

C Lyu, W Zhang, H Huang, Y Zhou, Y Wang… - arxiv preprint arxiv …, 2022 - arxiv.org
In this paper, we aim to design an efficient real-time object detector that exceeds the YOLO
series and is easily extensible for many object recognition tasks such as instance …

Adaface: Quality adaptive margin for face recognition

M Kim, AK Jain, X Liu - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Recognition in low quality face datasets is challenging because facial attributes are
obscured and degraded. Advances in margin-based loss functions have resulted in …

Curricular contrastive regularization for physics-aware single image dehazing

Y Zheng, J Zhan, S He, J Dong… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Considering the ill-posed nature, contrastive regularization has been developed for single
image dehazing, introducing the information from negative images as a lower bound …

Vim: Out-of-distribution with virtual-logit matching

H Wang, Z Li, L Feng, W Zhang - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Most of the existing Out-Of-Distribution (OOD) detection algorithms depend on single input
source: the feature, the logit, or the softmax probability. However, the immense diversity of …

Understanding the robustness in vision transformers

D Zhou, Z Yu, E **e, C **ao… - International …, 2022 - proceedings.mlr.press
Recent studies show that Vision Transformers (ViTs) exhibit strong robustness against
various corruptions. Although this property is partly attributed to the self-attention …