[HTML][HTML] Review of image classification algorithms based on convolutional neural networks
L Chen, S Li, Q Bai, J Yang, S Jiang, Y Miao - Remote Sensing, 2021 - mdpi.com
Image classification has always been a hot research direction in the world, and the
emergence of deep learning has promoted the development of this field. Convolutional …
emergence of deep learning has promoted the development of this field. Convolutional …
Auto-encoders in deep learning—a review with new perspectives
S Chen, W Guo - Mathematics, 2023 - mdpi.com
Deep learning, which is a subfield of machine learning, has opened a new era for the
development of neural networks. The auto-encoder is a key component of deep structure …
development of neural networks. The auto-encoder is a key component of deep structure …
Seeing beyond the brain: Conditional diffusion model with sparse masked modeling for vision decoding
Decoding visual stimuli from brain recordings aims to deepen our understanding of the
human visual system and build a solid foundation for bridging human and computer vision …
human visual system and build a solid foundation for bridging human and computer vision …
[PDF][PDF] The computational limits of deep learning
Deep learning's recent history has been one of achievement: from triumphing over humans
in the game of Go to world-leading performance in image classification, voice recognition …
in the game of Go to world-leading performance in image classification, voice recognition …
Yolact: Real-time instance segmentation
We present a simple, fully-convolutional model for real-time instance segmentation that
achieves 29.8 mAP on MS COCO at 33.5 fps evaluated on a single Titan Xp, which is …
achieves 29.8 mAP on MS COCO at 33.5 fps evaluated on a single Titan Xp, which is …
Bayesian loss for crowd count estimation with point supervision
In crowd counting datasets, each person is annotated by a point, which is usually the center
of the head. And the task is to estimate the total count in a crowd scene. Most of the state-of …
of the head. And the task is to estimate the total count in a crowd scene. Most of the state-of …
Refining activation downsampling with SoftPool
Abstract Convolutional Neural Networks (CNNs) use pooling to decrease the size of
activation maps. This process is crucial to increase the receptive fields and to reduce …
activation maps. This process is crucial to increase the receptive fields and to reduce …
Infrared-visible cross-modal person re-identification with an x modality
This paper focuses on the emerging Infrared-Visible cross-modal person re-identification
task (IV-ReID), which takes infrared images as input and matches with visible color images …
task (IV-ReID), which takes infrared images as input and matches with visible color images …
2D object recognition: a comparative analysis of SIFT, SURF and ORB feature descriptors
Object recognition is a key research area in the field of image processing and computer
vision, which recognizes the object in an image and provides a proper label. In the paper …
vision, which recognizes the object in an image and provides a proper label. In the paper …
Remote sensing image scene classification: Benchmark and state of the art
Remote sensing image scene classification plays an important role in a wide range of
applications and hence has been receiving remarkable attention. During the past years …
applications and hence has been receiving remarkable attention. During the past years …