Transformer-based multimodal feature enhancement networks for multimodal depression detection integrating video, audio and remote photoplethysmograph signals

H Fan, X Zhang, Y Xu, J Fang, S Zhang, X Zhao, J Yu - Information Fusion, 2024 - Elsevier
Depression stands as one of the most widespread psychological disorders and has
garnered increasing attention. Currently, how to effectively achieve automatic multimodal …

Context-aware emotion recognition networks

J Lee, S Kim, S Kim, J Park… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Traditional techniques for emotion recognition have focused on the facial expression
analysis only, thus providing limited ability to encode context that comprehensively …

Facial emotion recognition based real-time learner engagement detection system in online learning context using deep learning models

S Gupta, P Kumar, RK Tekchandani - Multimedia Tools and Applications, 2023 - Springer
The dramatic impact of the COVID-19 pandemic has resulted in the closure of physical
classrooms and teaching methods being shifted to the online medium. To make the online …

Multimodal spatiotemporal representation for automatic depression level detection

M Niu, J Tao, B Liu, J Huang… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
Physiological studies have shown that there are some differences in speech and facial
activities between depressive and healthy individuals. Based on this fact, we propose a …

[HTML][HTML] Deep-learning-based multimodal emotion classification for music videos

YR Pandeya, B Bhattarai, J Lee - Sensors, 2021 - mdpi.com
Music videos contain a great deal of visual and acoustic information. Each information
source within a music video influences the emotions conveyed through the audio and video …

Multi-modal recurrent attention networks for facial expression recognition

J Lee, S Kim, S Kim, K Sohn - IEEE Transactions on Image …, 2020 - ieeexplore.ieee.org
Recent deep neural networks based methods have achieved state-of-the-art performance
on various facial expression recognition tasks. Despite such progress, previous researches …

Stimuli-aware visual emotion analysis

J Yang, J Li, X Wang, Y Ding… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Visual emotion analysis (VEA) has attracted great attention recently, due to the increasing
tendency of expressing and understanding emotions through images on social networks …

A novel multimodal depression diagnosis approach utilizing a new hybrid fusion method

X Zhang, B Li, G Qi - Biomedical Signal Processing and Control, 2024 - Elsevier
In recent years, research has found that the impact of depression status primarily lies in
patients' language expression and facial expressions. Furthermore, facial expressions and …

A two-stage spatiotemporal attention convolution network for continuous dimensional emotion recognition from facial video

M Hu, Q Chu, X Wang, L He… - IEEE Signal Processing …, 2021 - ieeexplore.ieee.org
Continuous dimensional emotion recognition for facial video sequence is a crucial and
challenging task in Affective Computing and Human-Computer Intelligent Interaction. The …

Spatio-temporal encoder-decoder fully convolutional network for video-based dimensional emotion recognition

Z Du, S Wu, D Huang, W Li… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Video-based dimensional emotion recognition aims to map human affect into the
dimensional emotion space based on visual signals, which is a fundamental challenge in …