Fine-grained image analysis with deep learning: A survey

XS Wei, YZ Song, O Mac Aodha, J Wu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer
vision and pattern recognition, and underpins a diverse set of real-world applications. The …

Transfg: A transformer architecture for fine-grained recognition

J He, JN Chen, S Liu, A Kortylewski, C Yang… - Proceedings of the …, 2022 - ojs.aaai.org
Fine-grained visual classification (FGVC) which aims at recognizing objects from
subcategories is a very challenging task due to the inherently subtle inter-class differences …

Neural prototype trees for interpretable fine-grained image recognition

M Nauta, R Van Bree, C Seifert - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Prototype-based methods use interpretable representations to address the black-box nature
of deep learning models, in contrast to post-hoc explanation methods that only approximate …

Class attention network for image recognition

G Cheng, P Lai, D Gao, J Han - Science China Information Sciences, 2023 - Springer
Visual attention has become a popular and widely used component for image recognition.
Although various attention-based methods have been proposed and achieved relatively …

Feature fusion vision transformer for fine-grained visual categorization

J Wang, X Yu, Y Gao - arxiv preprint arxiv:2107.02341, 2021 - arxiv.org
The core for tackling the fine-grained visual categorization (FGVC) is to learn subtle yet
discriminative features. Most previous works achieve this by explicitly selecting the …

Vit-net: Interpretable vision transformers with neural tree decoder

S Kim, J Nam, BC Ko - International conference on machine …, 2022 - proceedings.mlr.press
Vision transformers (ViTs), which have demonstrated a state-of-the-art performance in image
classification, can also visualize global interpretations through attention-based contributions …

SwinFG: A fine-grained recognition scheme based on swin transformer

Z Ma, X Wu, A Chu, L Huang, Z Wei - Expert Systems with Applications, 2024 - Elsevier
Fine-grained image recognition (FGIR) is a challenging task as it requires the recognition of
sub-categories with subtle differences. Recently, the swin transformer has shown impressive …

A survey of neural trees: Co-evolving neural networks and decision trees

H Li, J Song, M Xue, H Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Neural networks (NNs) and decision trees (DTs) are both popular models of machine
learning, yet coming with mutually exclusive advantages and limitations. To bring the best of …

A spatial feature-enhanced attention neural network with high-order pooling representation for application in pest and disease recognition

J Kong, H Wang, C Yang, X **, M Zuo, X Zhang - Agriculture, 2022 - mdpi.com
With the development of advanced information and intelligence technologies, precision
agriculture has become an effective solution to monitor and prevent crop pests and …

Rams-trans: Recurrent attention multi-scale transformer for fine-grained image recognition

Y Hu, X **, Y Zhang, H Hong, J Zhang, Y He… - Proceedings of the 29th …, 2021 - dl.acm.org
In fine-grained image recognition (FGIR), the localization and amplification of region
attention is an important factor, which has been explored extensively convolutional neural …