Survey of optimization algorithms in modern neural networks

R Abdulkadirov, P Lyakhov, N Nagornov - Mathematics, 2023 - mdpi.com
The main goal of machine learning is the creation of self-learning algorithms in many areas
of human activity. It allows a replacement of a person with artificial intelligence in seeking to …

Contrastive semi-supervised learning for underwater image restoration via reliable bank

S Huang, K Wang, H Liu, J Chen… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Despite the remarkable achievement of recent underwater image restoration techniques, the
lack of labeled data has become a major hurdle for further progress. In this work, we …

Rethinking spatial dimensions of vision transformers

B Heo, S Yun, D Han, S Chun… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract Vision Transformer (ViT) extends the application range of transformers from
language processing to computer vision tasks as being an alternative architecture against …

Adan: Adaptive nesterov momentum algorithm for faster optimizing deep models

X **e, P Zhou, H Li, Z Lin, S Yan - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
In deep learning, different kinds of deep networks typically need different optimizers, which
have to be chosen after multiple trials, making the training process inefficient. To relieve this …

Probabilistic embeddings for cross-modal retrieval

S Chun, SJ Oh, RS De Rezende… - Proceedings of the …, 2021 - openaccess.thecvf.com
Cross-modal retrieval methods build a common representation space for samples from
multiple modalities, typically from the vision and the language domains. For images and …

Surrogate gap minimization improves sharpness-aware training

J Zhuang, B Gong, L Yuan, Y Cui, H Adam… - arxiv preprint arxiv …, 2022 - arxiv.org
The recently proposed Sharpness-Aware Minimization (SAM) improves generalization by
minimizing a\textit {perturbed loss} defined as the maximum loss within a neighborhood in …

SCViT: A spatial-channel feature preserving vision transformer for remote sensing image scene classification

P Lv, W Wu, Y Zhong, F Du… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Convolutional neural network (CNN)-based methods are widely used in remote sensing
image scene classification and can obtain excellent performances. However, the stacked …

Leveraging real talking faces via self-supervision for robust forgery detection

A Haliassos, R Mira, S Petridis… - Proceedings of the …, 2022 - openaccess.thecvf.com
One of the most pressing challenges for the detection of face-manipulated videos is
generalising to forgery methods not seen during training while remaining effective under …

Fault diagnosis for small samples based on attention mechanism

X Zhang, C He, Y Lu, B Chen, L Zhu, L Zhang - Measurement, 2022 - Elsevier
Aiming at the application of deep learning in fault diagnosis, mechanical rotating equipment
components are prone to failure under complex working environment, and the industrial big …

Large-scale differentially private BERT

R Anil, B Ghazi, V Gupta, R Kumar… - arxiv preprint arxiv …, 2021 - arxiv.org
In this work, we study the large-scale pretraining of BERT-Large with differentially private
SGD (DP-SGD). We show that combined with a careful implementation, scaling up the batch …