[HTML][HTML] Review of image classification algorithms based on convolutional neural networks

L Chen, S Li, Q Bai, J Yang, S Jiang, Y Miao - Remote Sensing, 2021 - mdpi.com
Image classification has always been a hot research direction in the world, and the
emergence of deep learning has promoted the development of this field. Convolutional …

Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

Scaling up your kernels to 31x31: Revisiting large kernel design in cnns

X Ding, X Zhang, J Han, G Ding - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by
recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few …

Informer: Beyond efficient transformer for long sequence time-series forecasting

H Zhou, S Zhang, J Peng, S Zhang, J Li… - Proceedings of the …, 2021 - ojs.aaai.org
Many real-world applications require the prediction of long sequence time-series, such as
electricity consumption planning. Long sequence time-series forecasting (LSTF) demands a …

Contrastive learning for unpaired image-to-image translation

T Park, AA Efros, R Zhang, JY Zhu - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
In image-to-image translation, each patch in the output should reflect the content of the
corresponding patch in the input, independent of domain. We propose a straightforward …

Deep high-resolution representation learning for visual recognition

J Wang, K Sun, T Cheng, B Jiang… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
High-resolution representations are essential for position-sensitive vision problems, such as
human pose estimation, semantic segmentation, and object detection. Existing state-of-the …

Conformer: Local features coupling global representations for visual recognition

Z Peng, W Huang, S Gu, L **e… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract Within Convolutional Neural Network (CNN), the convolution operations are good
at extracting local features but experience difficulty to capture global representations. Within …

Generative modeling by estimating gradients of the data distribution

Y Song, S Ermon - Advances in neural information …, 2019 - proceedings.neurips.cc
We introduce a new generative model where samples are produced via Langevin dynamics
using gradients of the data distribution estimated with score matching. Because gradients …

Semantic image synthesis with spatially-adaptive normalization

T Park, MY Liu, TC Wang… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
We propose spatially-adaptive normalization, a simple but effective layer for synthesizing
photorealistic images given an input semantic layout. Previous methods directly feed the …

Selective kernel networks

X Li, W Wang, X Hu, J Yang - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Abstract In standard Convolutional Neural Networks (CNNs), the receptive fields of artificial
neurons in each layer are designed to share the same size. It is well-known in the …