Unlocking the emotional world of visual media: An overview of the science, research, and impact of understanding emotion
The emergence of artificial emotional intelligence technology is revolutionizing the fields of
computers and robotics, allowing for a new level of communication and understanding of …
computers and robotics, allowing for a new level of communication and understanding of …
Bridging computer and education sciences: A systematic review of automated emotion recognition in online learning environments
S Yu, A Androsov, H Yan, Y Chen - Computers & Education, 2024 - Elsevier
Emotions play an important role in the learning process. With intelligent technology support,
identification and intervention of learners' cognition have made great achievement, but the …
identification and intervention of learners' cognition have made great achievement, but the …
Dip: Dual incongruity perceiving network for sarcasm detection
Sarcasm indicates the literal meaning is contrary to the real attitude. Considering the
popularity and complementarity of image-text data, we investigate the task of multi-modal …
popularity and complementarity of image-text data, we investigate the task of multi-modal …
High-fidelity generalized emotional talking face generation with multi-modal emotion space learning
Recently, emotional talking face generation has received considerable attention. However,
existing methods only adopt one-hot coding, image, or audio as emotion conditions, thus …
existing methods only adopt one-hot coding, image, or audio as emotion conditions, thus …
Mart: Masked affective representation learning via masked temporal distribution distillation
Limited training data is a long-standing problem for video emotion analysis (VEA). Existing
works leverage the power of large-scale image datasets for transferring while failing to …
works leverage the power of large-scale image datasets for transferring while failing to …
Multimodal large language models: A survey
The exploration of multimodal language models integrates multiple data types, such as
images, text, language, audio, and other heterogeneity. While the latest large language …
images, text, language, audio, and other heterogeneity. While the latest large language …
Extdm: Distribution extrapolation diffusion model for video prediction
Video prediction is a challenging task due to its nature of uncertainty especially for
forecasting a long period. To model the temporal dynamics advanced methods benefit from …
forecasting a long period. To model the temporal dynamics advanced methods benefit from …
Progressive neighbor consistency mining for correspondence pruning
The goal of correspondence pruning is to recognize correct correspondences (inliers) from
initial ones, with applications to various feature matching based tasks. Seeking neighbors in …
initial ones, with applications to various feature matching based tasks. Seeking neighbors in …
Multi-modal emotion recognition using EEG and speech signals
Abstract Automatic Emotion Recognition (AER) is critical for naturalistic Human–Machine
Interactions (HMI). Emotions can be detected through both external behaviors, eg, tone of …
Interactions (HMI). Emotions can be detected through both external behaviors, eg, tone of …
Weakly supervised video emotion detection and prediction via cross-modal temporal erasing network
Automatically predicting the emotions of user-generated videos (UGVs) receives increasing
interest recently. However, existing methods mainly focus on a few key visual frames, which …
interest recently. However, existing methods mainly focus on a few key visual frames, which …