Survey of optimization algorithms in modern neural networks
The main goal of machine learning is the creation of self-learning algorithms in many areas
of human activity. It allows a replacement of a person with artificial intelligence in seeking to …
of human activity. It allows a replacement of a person with artificial intelligence in seeking to …
Contrastive semi-supervised learning for underwater image restoration via reliable bank
Despite the remarkable achievement of recent underwater image restoration techniques, the
lack of labeled data has become a major hurdle for further progress. In this work, we …
lack of labeled data has become a major hurdle for further progress. In this work, we …
Rethinking spatial dimensions of vision transformers
Abstract Vision Transformer (ViT) extends the application range of transformers from
language processing to computer vision tasks as being an alternative architecture against …
language processing to computer vision tasks as being an alternative architecture against …
Adan: Adaptive nesterov momentum algorithm for faster optimizing deep models
In deep learning, different kinds of deep networks typically need different optimizers, which
have to be chosen after multiple trials, making the training process inefficient. To relieve this …
have to be chosen after multiple trials, making the training process inefficient. To relieve this …
Probabilistic embeddings for cross-modal retrieval
Cross-modal retrieval methods build a common representation space for samples from
multiple modalities, typically from the vision and the language domains. For images and …
multiple modalities, typically from the vision and the language domains. For images and …
Surrogate gap minimization improves sharpness-aware training
The recently proposed Sharpness-Aware Minimization (SAM) improves generalization by
minimizing a\textit {perturbed loss} defined as the maximum loss within a neighborhood in …
minimizing a\textit {perturbed loss} defined as the maximum loss within a neighborhood in …
SCViT: A spatial-channel feature preserving vision transformer for remote sensing image scene classification
P Lv, W Wu, Y Zhong, F Du… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Convolutional neural network (CNN)-based methods are widely used in remote sensing
image scene classification and can obtain excellent performances. However, the stacked …
image scene classification and can obtain excellent performances. However, the stacked …
Leveraging real talking faces via self-supervision for robust forgery detection
One of the most pressing challenges for the detection of face-manipulated videos is
generalising to forgery methods not seen during training while remaining effective under …
generalising to forgery methods not seen during training while remaining effective under …
Fault diagnosis for small samples based on attention mechanism
Aiming at the application of deep learning in fault diagnosis, mechanical rotating equipment
components are prone to failure under complex working environment, and the industrial big …
components are prone to failure under complex working environment, and the industrial big …
Large-scale differentially private BERT
In this work, we study the large-scale pretraining of BERT-Large with differentially private
SGD (DP-SGD). We show that combined with a careful implementation, scaling up the batch …
SGD (DP-SGD). We show that combined with a careful implementation, scaling up the batch …