Unlocking the emotional world of visual media: An overview of the science, research, and impact of understanding emotion

JZ Wang, S Zhao, C Wu, RB Adams… - Proceedings of the …, 2023‏ - ieeexplore.ieee.org
The emergence of artificial emotional intelligence technology is revolutionizing the fields of
computers and robotics, allowing for a new level of communication and understanding of …

Bridging computer and education sciences: A systematic review of automated emotion recognition in online learning environments

S Yu, A Androsov, H Yan, Y Chen - Computers & Education, 2024‏ - Elsevier
Emotions play an important role in the learning process. With intelligent technology support,
identification and intervention of learners' cognition have made great achievement, but the …

Dip: Dual incongruity perceiving network for sarcasm detection

C Wen, G Jia, J Yang - … of the IEEE/CVF Conference on …, 2023‏ - openaccess.thecvf.com
Sarcasm indicates the literal meaning is contrary to the real attitude. Considering the
popularity and complementarity of image-text data, we investigate the task of multi-modal …

High-fidelity generalized emotional talking face generation with multi-modal emotion space learning

C Xu, J Zhu, J Zhang, Y Han, W Chu… - Proceedings of the …, 2023‏ - openaccess.thecvf.com
Recently, emotional talking face generation has received considerable attention. However,
existing methods only adopt one-hot coding, image, or audio as emotion conditions, thus …

Mart: Masked affective representation learning via masked temporal distribution distillation

Z Zhang, P Zhao, E Park… - Proceedings of the IEEE …, 2024‏ - openaccess.thecvf.com
Limited training data is a long-standing problem for video emotion analysis (VEA). Existing
works leverage the power of large-scale image datasets for transferring while failing to …

Multimodal large language models: A survey

J Wu, W Gan, Z Chen, S Wan… - 2023 IEEE International …, 2023‏ - ieeexplore.ieee.org
The exploration of multimodal language models integrates multiple data types, such as
images, text, language, audio, and other heterogeneity. While the latest large language …

Extdm: Distribution extrapolation diffusion model for video prediction

Z Zhang, J Hu, W Cheng, D Paudel… - Proceedings of the …, 2024‏ - openaccess.thecvf.com
Video prediction is a challenging task due to its nature of uncertainty especially for
forecasting a long period. To model the temporal dynamics advanced methods benefit from …

Progressive neighbor consistency mining for correspondence pruning

X Liu, J Yang - Proceedings of the IEEE/CVF Conference …, 2023‏ - openaccess.thecvf.com
The goal of correspondence pruning is to recognize correct correspondences (inliers) from
initial ones, with applications to various feature matching based tasks. Seeking neighbors in …

Multi-modal emotion recognition using EEG and speech signals

Q Wang, M Wang, Y Yang, X Zhang - Computers in Biology and Medicine, 2022‏ - Elsevier
Abstract Automatic Emotion Recognition (AER) is critical for naturalistic Human–Machine
Interactions (HMI). Emotions can be detected through both external behaviors, eg, tone of …

Weakly supervised video emotion detection and prediction via cross-modal temporal erasing network

Z Zhang, L Wang, J Yang - … of the IEEE/CVF Conference on …, 2023‏ - openaccess.thecvf.com
Automatically predicting the emotions of user-generated videos (UGVs) receives increasing
interest recently. However, existing methods mainly focus on a few key visual frames, which …