[HTML][HTML] Review of image classification algorithms based on convolutional neural networks
L Chen, S Li, Q Bai, J Yang, S Jiang, Y Miao - Remote Sensing, 2021 - mdpi.com
Image classification has always been a hot research direction in the world, and the
emergence of deep learning has promoted the development of this field. Convolutional …
emergence of deep learning has promoted the development of this field. Convolutional …
Recent advances and clinical applications of deep learning in medical image analysis
Deep learning has received extensive research interest in develo** new medical image
processing algorithms, and deep learning based models have been remarkably successful …
processing algorithms, and deep learning based models have been remarkably successful …
Vision mamba: Efficient visual representation learning with bidirectional state space model
Recently the state space models (SSMs) with efficient hardware-aware designs, ie, the
Mamba deep learning model, have shown great potential for long sequence modeling …
Mamba deep learning model, have shown great potential for long sequence modeling …
Cswin transformer: A general vision transformer backbone with cross-shaped windows
Abstract We present CSWin Transformer, an efficient and effective Transformer-based
backbone for general-purpose vision tasks. A challenging issue in Transformer design is …
backbone for general-purpose vision tasks. A challenging issue in Transformer design is …
Pyramid vision transformer: A versatile backbone for dense prediction without convolutions
Although convolutional neural networks (CNNs) have achieved great success in computer
vision, this work investigates a simpler, convolution-free backbone network useful for many …
vision, this work investigates a simpler, convolution-free backbone network useful for many …
Accuracy on the line: on the strong correlation between out-of-distribution and in-distribution generalization
For machine learning systems to be reliable, we must understand their performance in
unseen, out-of-distribution environments. In this paper, we empirically show that out-of …
unseen, out-of-distribution environments. In this paper, we empirically show that out-of …
Towards real-world blind face restoration with generative facial prior
Blind face restoration usually relies on facial priors, such as facial geometry prior or
reference prior, to restore realistic and faithful details. However, very low-quality inputs …
reference prior, to restore realistic and faithful details. However, very low-quality inputs …
Rethinking rotated object detection with gaussian wasserstein distance loss
Boundary discontinuity and its inconsistency to the final detection metric have been the
bottleneck for rotating detection regression loss design. In this paper, we propose a novel …
bottleneck for rotating detection regression loss design. In this paper, we propose a novel …
Volo: Vision outlooker for visual recognition
Recently, Vision Transformers (ViTs) have been broadly explored in visual recognition. With
low efficiency in encoding fine-level features, the performance of ViTs is still inferior to the …
low efficiency in encoding fine-level features, the performance of ViTs is still inferior to the …
[PDF][PDF] The computational limits of deep learning
Deep learning's recent history has been one of achievement: from triumphing over humans
in the game of Go to world-leading performance in image classification, voice recognition …
in the game of Go to world-leading performance in image classification, voice recognition …