Multi-attentional deepfake detection

H Zhao, W Zhou, D Chen, T Wei… - Proceedings of the …, 2021 - openaccess.thecvf.com
Face forgery by deepfake is widely spread over the internet and has raised severe societal
concerns. Recently, how to detect such forgery contents has become a hot research topic …

Fine-grained image analysis with deep learning: A survey

XS Wei, YZ Song, O Mac Aodha, J Wu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer
vision and pattern recognition, and underpins a diverse set of real-world applications. The …

Dual cross-attention learning for fine-grained visual categorization and object re-identification

H Zhu, W Ke, D Li, J Liu, L Tian… - Proceedings of the …, 2022 - openaccess.thecvf.com
Recently, self-attention mechanisms have shown impressive performance in various NLP
and CV tasks, which can help capture sequential characteristics and derive global …

Transfg: A transformer architecture for fine-grained recognition

J He, JN Chen, S Liu, A Kortylewski, C Yang… - Proceedings of the …, 2022 - ojs.aaai.org
Fine-grained visual classification (FGVC) which aims at recognizing objects from
subcategories is a very challenging task due to the inherently subtle inter-class differences …

SwinFG: A fine-grained recognition scheme based on swin transformer

Z Ma, X Wu, A Chu, L Huang, Z Wei - Expert Systems with Applications, 2024 - Elsevier
Fine-grained image recognition (FGIR) is a challenging task as it requires the recognition of
sub-categories with subtle differences. Recently, the swin transformer has shown impressive …

TransIFC: Invariant cues-aware feature concentration learning for efficient fine-grained bird image classification

H Liu, C Zhang, Y Deng, B **e, T Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Fine-grained bird image classification (FBIC) is not only meaningful for endangered bird
observation and protection but also a prevalent task for image classification in multimedia …

Feature fusion vision transformer for fine-grained visual categorization

J Wang, X Yu, Y Gao - arxiv preprint arxiv:2107.02341, 2021 - arxiv.org
The core for tackling the fine-grained visual categorization (FGVC) is to learn subtle yet
discriminative features. Most previous works achieve this by explicitly selecting the …

A spatial feature-enhanced attention neural network with high-order pooling representation for application in pest and disease recognition

J Kong, H Wang, C Yang, X **, M Zuo, X Zhang - Agriculture, 2022 - mdpi.com
With the development of advanced information and intelligence technologies, precision
agriculture has become an effective solution to monitor and prevent crop pests and …

Vit-net: Interpretable vision transformers with neural tree decoder

S Kim, J Nam, BC Ko - International conference on machine …, 2022 - proceedings.mlr.press
Vision transformers (ViTs), which have demonstrated a state-of-the-art performance in image
classification, can also visualize global interpretations through attention-based contributions …

Large scale visual food recognition

W Min, Z Wang, Y Liu, M Luo, L Kang… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Food recognition plays an important role in food choice and intake, which is essential to the
health and well‐being of humans. It is thus of importance to the computer vision community …