A survey of the vision transformers and their CNN-transformer based variants

A Khan, Z Rauf, A Sohail, AR Khan, H Asif… - Artificial Intelligence …, 2023 - Springer
Vision transformers have become popular as a possible substitute to convolutional neural
networks (CNNs) for a variety of computer vision applications. These transformers, with their …

Deconv-transformer (DecT): A histopathological image classification model for breast cancer based on color deconvolution and transformer architecture

Z He, M Lin, Z Xu, Z Yao, H Chen, A Alhudhaif… - Information …, 2022 - Elsevier
Histopathological image recognition of breast cancer is an onerous task. Although many
deep learning models have achieved good classification results on histopathological image …

Few-shot learning for facial expression recognition: a comprehensive survey

CL Kim, BG Kim - Journal of Real-Time Image Processing, 2023 - Springer
Facial expression recognition (FER) is utilized in various fields that analyze facial
expressions. FER is attracting increasing attention for its role in improving the convenience …

Transformer-based multimodal feature enhancement networks for multimodal depression detection integrating video, audio and remote photoplethysmograph signals

H Fan, X Zhang, Y Xu, J Fang, S Zhang, X Zhao, J Yu - Information Fusion, 2024 - Elsevier
Depression stands as one of the most widespread psychological disorders and has
garnered increasing attention. Currently, how to effectively achieve automatic multimodal …

[HTML][HTML] A hybrid model of ghost-convolution enlightened transformer for effective diagnosis of grape leaf disease and pest

X Lu, R Yang, J Zhou, J Jiao, F Liu, Y Liu, B Su… - Journal of King Saud …, 2022 - Elsevier
Disease and pest are the main factors causing grape yield reduction. Correct and timely
identification of these symptoms are necessary for the vineyard. However, the commonly …

A deeper look into aleatoric and epistemic uncertainty disentanglement

M Valdenegro-Toro, DS Mori - 2022 IEEE/CVF Conference on …, 2022 - ieeexplore.ieee.org
Neural networks are ubiquitous in many tasks, but trusting their predictions is an open issue.
Uncertainty quantification is required for many applications, and disentangled aleatoric and …

An ensemble of a boosted hybrid of deep learning models and technical analysis for forecasting stock prices

AF Kamara, E Chen, Z Pan - Information Sciences, 2022 - Elsevier
For several years the modeling as well as forecasting of the prices of stocks have been
extremely challenging for the business community and researchers as a result of the …

Self-supervised vision transformer-based few-shot learning for facial expression recognition

X Chen, X Zheng, K Sun, W Liu, Y Zhang - Information Sciences, 2023 - Elsevier
Facial expression recognition (FER) is embedded in many real-world human-computer
interaction tasks, such as online learning, depression recognition and remote diagnosis …

Hierarchical attention network with progressive feature fusion for facial expression recognition

H Tao, Q Duan - Neural Networks, 2024 - Elsevier
Facial expression recognition (FER) in the wild is challenging due to the disturbing factors
including pose variation, occlusions, and illumination variation. The attention mechanism …

MMATrans: Muscle movement aware representation learning for facial expression recognition via transformers

H Liu, Q Zhou, C Zhang, J Zhu, T Liu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
How to automatically recognize facial expression has caused concerns in industrial human–
robot interaction. However, facial expression recognition (FER) is susceptible to problems …