Learning multi-scale features for speech emotion recognition with connection attention mechanism
Speech emotion recognition (SER) has become a crucial topic in the field of human–
computer interactions. Feature representation plays an important role in SER, but there are …
computer interactions. Feature representation plays an important role in SER, but there are …
Learning efficient representations for keyword spotting with triplet loss
R Vygon, N Mikhaylovskiy - … 2021, St. Petersburg, Russia, September 27 …, 2021 - Springer
In the past few years, triplet loss-based metric embeddings have become a de-facto
standard for several important computer vision problems, most notably, person …
standard for several important computer vision problems, most notably, person …
Robust human face emotion classification using triplet-loss-based deep CNN features and SVM
Human facial emotion detection is one of the challenging tasks in computer vision. Owing to
high inter-class variance, it is hard for machine learning models to predict facial emotions …
high inter-class variance, it is hard for machine learning models to predict facial emotions …
Self-supervised endoscopic image key-points matching
M Farhat, H Chaabouni-Chouayakh… - Expert Systems with …, 2023 - Elsevier
Feature matching and finding correspondences between endoscopic images is a key step in
many clinical applications such as patient follow-up and generation of panoramic image …
many clinical applications such as patient follow-up and generation of panoramic image …
Interpretable multimodal emotion recognition using hybrid fusion of speech and image data
This paper proposes a multimodal emotion recognition system based on hybrid fusion that
classifies the emotions depicted by speech utterances and corresponding images into …
classifies the emotions depicted by speech utterances and corresponding images into …
Utterance level feature aggregation with deep metric learning for speech emotion recognition
Emotion is a form of high-level paralinguistic information that is intrinsically conveyed by
human speech. Automatic speech emotion recognition is an essential challenge for various …
human speech. Automatic speech emotion recognition is an essential challenge for various …
Optimal Transport with Class Structure Exploration for Cross-Domain Speech Emotion Recognition
Speech emotion recognition (SER) has widespread applications in human-computer
interaction. However, the performance of SER models often drops in domain mismatch …
interaction. However, the performance of SER models often drops in domain mismatch …
Vector learning representation for generalized speech emotion recognition
Speech emotion recognition (SER) plays an important role in global business today to
improve service efficiency. In the literature of SER, many techniques have been using deep …
improve service efficiency. In the literature of SER, many techniques have been using deep …
[HTML][HTML] Feature-Enhanced Multi-Task Learning for Speech Emotion Recognition Using Decision Trees and LSTM
C Wang, X Shen - Electronics, 2024 - mdpi.com
Speech emotion recognition (SER) plays an important role in human-computer interaction
(HCI) technology and has a wide range of application scenarios in medical medicine …
(HCI) technology and has a wide range of application scenarios in medical medicine …
Quantifying emotional similarity in speech
This study proposes the novel formulation of measuring emotional similarity between
speech recordings. This formulation explores the ordinal nature of emotions by comparing …
speech recordings. This formulation explores the ordinal nature of emotions by comparing …