Learning multi-scale features for speech emotion recognition with connection attention mechanism

Z Chen, J Li, H Liu, X Wang, H Wang… - Expert Systems with …, 2023 - Elsevier
Speech emotion recognition (SER) has become a crucial topic in the field of human–
computer interactions. Feature representation plays an important role in SER, but there are …

Learning efficient representations for keyword spotting with triplet loss

R Vygon, N Mikhaylovskiy - … 2021, St. Petersburg, Russia, September 27 …, 2021 - Springer
In the past few years, triplet loss-based metric embeddings have become a de-facto
standard for several important computer vision problems, most notably, person …

Robust human face emotion classification using triplet-loss-based deep CNN features and SVM

I Haider, HJ Yang, GS Lee, SH Kim - Sensors, 2023 - mdpi.com
Human facial emotion detection is one of the challenging tasks in computer vision. Owing to
high inter-class variance, it is hard for machine learning models to predict facial emotions …

Self-supervised endoscopic image key-points matching

M Farhat, H Chaabouni-Chouayakh… - Expert Systems with …, 2023 - Elsevier
Feature matching and finding correspondences between endoscopic images is a key step in
many clinical applications such as patient follow-up and generation of panoramic image …

Interpretable multimodal emotion recognition using hybrid fusion of speech and image data

P Kumar, S Malik, B Raman - Multimedia Tools and Applications, 2024 - Springer
This paper proposes a multimodal emotion recognition system based on hybrid fusion that
classifies the emotions depicted by speech utterances and corresponding images into …

Utterance level feature aggregation with deep metric learning for speech emotion recognition

B Mocanu, R Tapu, T Zaharia - Sensors, 2021 - mdpi.com
Emotion is a form of high-level paralinguistic information that is intrinsically conveyed by
human speech. Automatic speech emotion recognition is an essential challenge for various …

Optimal Transport with Class Structure Exploration for Cross-Domain Speech Emotion Recognition

R Zhang, J Wei, X Lu, J Xu, Y Li, D **… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Speech emotion recognition (SER) has widespread applications in human-computer
interaction. However, the performance of SER models often drops in domain mismatch …

Vector learning representation for generalized speech emotion recognition

S Singkul, K Woraratpanya - Heliyon, 2022 - cell.com
Speech emotion recognition (SER) plays an important role in global business today to
improve service efficiency. In the literature of SER, many techniques have been using deep …

[HTML][HTML] Feature-Enhanced Multi-Task Learning for Speech Emotion Recognition Using Decision Trees and LSTM

C Wang, X Shen - Electronics, 2024 - mdpi.com
Speech emotion recognition (SER) plays an important role in human-computer interaction
(HCI) technology and has a wide range of application scenarios in medical medicine …

Quantifying emotional similarity in speech

J Harvill, SG Leem, M AbdelWahab… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
This study proposes the novel formulation of measuring emotional similarity between
speech recordings. This formulation explores the ordinal nature of emotions by comparing …