[HTML][HTML] Review of image classification algorithms based on convolutional neural networks
L Chen, S Li, Q Bai, J Yang, S Jiang, Y Miao - Remote Sensing, 2021 - mdpi.com
Image classification has always been a hot research direction in the world, and the
emergence of deep learning has promoted the development of this field. Convolutional …
emergence of deep learning has promoted the development of this field. Convolutional …
Multimodal image synthesis and editing: A survey and taxonomy
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …
among multimodal information plays a key role for the creation and perception of multimodal …
Scaling up your kernels to 31x31: Revisiting large kernel design in cnns
We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by
recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few …
recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few …
Informer: Beyond efficient transformer for long sequence time-series forecasting
Many real-world applications require the prediction of long sequence time-series, such as
electricity consumption planning. Long sequence time-series forecasting (LSTF) demands a …
electricity consumption planning. Long sequence time-series forecasting (LSTF) demands a …
Contrastive learning for unpaired image-to-image translation
In image-to-image translation, each patch in the output should reflect the content of the
corresponding patch in the input, independent of domain. We propose a straightforward …
corresponding patch in the input, independent of domain. We propose a straightforward …
Deep high-resolution representation learning for visual recognition
High-resolution representations are essential for position-sensitive vision problems, such as
human pose estimation, semantic segmentation, and object detection. Existing state-of-the …
human pose estimation, semantic segmentation, and object detection. Existing state-of-the …
Conformer: Local features coupling global representations for visual recognition
Abstract Within Convolutional Neural Network (CNN), the convolution operations are good
at extracting local features but experience difficulty to capture global representations. Within …
at extracting local features but experience difficulty to capture global representations. Within …
Generative modeling by estimating gradients of the data distribution
We introduce a new generative model where samples are produced via Langevin dynamics
using gradients of the data distribution estimated with score matching. Because gradients …
using gradients of the data distribution estimated with score matching. Because gradients …
Semantic image synthesis with spatially-adaptive normalization
We propose spatially-adaptive normalization, a simple but effective layer for synthesizing
photorealistic images given an input semantic layout. Previous methods directly feed the …
photorealistic images given an input semantic layout. Previous methods directly feed the …
Selective kernel networks
Abstract In standard Convolutional Neural Networks (CNNs), the receptive fields of artificial
neurons in each layer are designed to share the same size. It is well-known in the …
neurons in each layer are designed to share the same size. It is well-known in the …