[HTML][HTML] Review of image classification algorithms based on convolutional neural networks

L Chen, S Li, Q Bai, J Yang, S Jiang, Y Miao - Remote Sensing, 2021 - mdpi.com
Image classification has always been a hot research direction in the world, and the
emergence of deep learning has promoted the development of this field. Convolutional …

Recent advances and clinical applications of deep learning in medical image analysis

X Chen, X Wang, K Zhang, KM Fung, TC Thai… - Medical image …, 2022 - Elsevier
Deep learning has received extensive research interest in develo** new medical image
processing algorithms, and deep learning based models have been remarkably successful …

Vision mamba: Efficient visual representation learning with bidirectional state space model

L Zhu, B Liao, Q Zhang, X Wang, W Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Recently the state space models (SSMs) with efficient hardware-aware designs, ie, the
Mamba deep learning model, have shown great potential for long sequence modeling …

Cswin transformer: A general vision transformer backbone with cross-shaped windows

X Dong, J Bao, D Chen, W Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract We present CSWin Transformer, an efficient and effective Transformer-based
backbone for general-purpose vision tasks. A challenging issue in Transformer design is …

Pyramid vision transformer: A versatile backbone for dense prediction without convolutions

W Wang, E **e, X Li, DP Fan, K Song… - Proceedings of the …, 2021 - openaccess.thecvf.com
Although convolutional neural networks (CNNs) have achieved great success in computer
vision, this work investigates a simpler, convolution-free backbone network useful for many …

Accuracy on the line: on the strong correlation between out-of-distribution and in-distribution generalization

JP Miller, R Taori, A Raghunathan… - International …, 2021 - proceedings.mlr.press
For machine learning systems to be reliable, we must understand their performance in
unseen, out-of-distribution environments. In this paper, we empirically show that out-of …

Towards real-world blind face restoration with generative facial prior

X Wang, Y Li, H Zhang, Y Shan - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Blind face restoration usually relies on facial priors, such as facial geometry prior or
reference prior, to restore realistic and faithful details. However, very low-quality inputs …

Rethinking rotated object detection with gaussian wasserstein distance loss

X Yang, J Yan, Q Ming, W Wang… - … on machine learning, 2021 - proceedings.mlr.press
Boundary discontinuity and its inconsistency to the final detection metric have been the
bottleneck for rotating detection regression loss design. In this paper, we propose a novel …

Volo: Vision outlooker for visual recognition

L Yuan, Q Hou, Z Jiang, J Feng… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Recently, Vision Transformers (ViTs) have been broadly explored in visual recognition. With
low efficiency in encoding fine-level features, the performance of ViTs is still inferior to the …

[PDF][PDF] The computational limits of deep learning

NC Thompson, K Greenewald, K Lee… - arxiv preprint arxiv …, 2020 - assets.pubpub.org
Deep learning's recent history has been one of achievement: from triumphing over humans
in the game of Go to world-leading performance in image classification, voice recognition …