[HTML][HTML] Review of image classification algorithms based on convolutional neural networks

L Chen, S Li, Q Bai, J Yang, S Jiang, Y Miao - Remote Sensing, 2021 - mdpi.com
Image classification has always been a hot research direction in the world, and the
emergence of deep learning has promoted the development of this field. Convolutional …

Auto-encoders in deep learning—a review with new perspectives

S Chen, W Guo - Mathematics, 2023 - mdpi.com
Deep learning, which is a subfield of machine learning, has opened a new era for the
development of neural networks. The auto-encoder is a key component of deep structure …

Seeing beyond the brain: Conditional diffusion model with sparse masked modeling for vision decoding

Z Chen, J Qing, T **ang, WL Yue… - Proceedings of the …, 2023 - openaccess.thecvf.com
Decoding visual stimuli from brain recordings aims to deepen our understanding of the
human visual system and build a solid foundation for bridging human and computer vision …

[PDF][PDF] The computational limits of deep learning

NC Thompson, K Greenewald, K Lee… - arxiv preprint arxiv …, 2020 - assets.pubpub.org
Deep learning's recent history has been one of achievement: from triumphing over humans
in the game of Go to world-leading performance in image classification, voice recognition …

Yolact: Real-time instance segmentation

D Bolya, C Zhou, F **ao, YJ Lee - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
We present a simple, fully-convolutional model for real-time instance segmentation that
achieves 29.8 mAP on MS COCO at 33.5 fps evaluated on a single Titan Xp, which is …

Bayesian loss for crowd count estimation with point supervision

Z Ma, X Wei, X Hong, Y Gong - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
In crowd counting datasets, each person is annotated by a point, which is usually the center
of the head. And the task is to estimate the total count in a crowd scene. Most of the state-of …

Refining activation downsampling with SoftPool

A Stergiou, R Poppe… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Abstract Convolutional Neural Networks (CNNs) use pooling to decrease the size of
activation maps. This process is crucial to increase the receptive fields and to reduce …

Infrared-visible cross-modal person re-identification with an x modality

D Li, X Wei, X Hong, Y Gong - Proceedings of the AAAI conference on …, 2020 - ojs.aaai.org
This paper focuses on the emerging Infrared-Visible cross-modal person re-identification
task (IV-ReID), which takes infrared images as input and matches with visible color images …

2D object recognition: a comparative analysis of SIFT, SURF and ORB feature descriptors

M Bansal, M Kumar, M Kumar - Multimedia Tools and Applications, 2021 - Springer
Object recognition is a key research area in the field of image processing and computer
vision, which recognizes the object in an image and provides a proper label. In the paper …

Remote sensing image scene classification: Benchmark and state of the art

G Cheng, J Han, X Lu - Proceedings of the IEEE, 2017 - ieeexplore.ieee.org
Remote sensing image scene classification plays an important role in a wide range of
applications and hence has been receiving remarkable attention. During the past years …