Dawn of the transformer era in speech emotion recognition: closing the valence gap
J Wagner, A Triantafyllopoulos… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Recent advances in transformer-based architectures have shown promise in several
machine learning tasks. In the audio domain, such architectures have been successfully …
machine learning tasks. In the audio domain, such architectures have been successfully …
A review on speech emotion recognition: a survey, recent advances, challenges, and the influence of noise
SM George, PM Ilyas - Neurocomputing, 2024 - Elsevier
Affective Computing systems can detect the emotional state and mindset of an individual.
Speech Emotion Recognition (SER) is a unimodal affect computing system based on …
Speech Emotion Recognition (SER) is a unimodal affect computing system based on …
Designing and Evaluating Speech Emotion Recognition Systems: A reality check case study with IEMOCAP
N Antoniou, A Katsamanis… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
There is an imminent need for guidelines and standard test sets to allow direct and fair
comparisons of speech emotion recognition (SER). While resources, such as the Interactive …
comparisons of speech emotion recognition (SER). While resources, such as the Interactive …
[PDF][PDF] Automatic Detection and Assessment of Alzheimer Disease Using Speech and Language Technologies in Low-Resource Scenarios.
In this study, we analyze the use of speech and speaker recognition technologies and
natural language processing to detect Alzheimer disease (AD) and estimate mini-mental …
natural language processing to detect Alzheimer disease (AD) and estimate mini-mental …
EdgeYOLO: an edge-real-time object detector
An efficient, low-complexity, and anchor-free object detector based on the state-of-the-art
YOLO framework is proposed in this paper, which can be implemented in real time on edge …
YOLO framework is proposed in this paper, which can be implemented in real time on edge …
Privacy-preserving speech emotion recognition through semi-supervised federated learning
V Tsouvalas, T Ozcelebi… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Speech Emotion Recognition (SER) refers to the recognition of human emotions from
natural speech. If done accurately, it can offer a number of benefits in building human …
natural speech. If done accurately, it can offer a number of benefits in building human …
Non-contrastive self-supervised learning for utterance-level information extraction from speech
In recent studies, self-supervised pre-trained models tend to outperform supervised pre-
trained models in transfer learning. In particular, self-supervised learning of utterance-level …
trained models in transfer learning. In particular, self-supervised learning of utterance-level …
ATDA: Attentional temporal dynamic activation for speech emotion recognition
LY Liu, WZ Liu, J Zhou, HY Deng, L Feng - Knowledge-Based Systems, 2022 - Elsevier
Speech emotion recognition (SER) plays a vital role in intelligent human–computer
interaction (HCI). The Convolutional Neural Network (CNN) is widely used in SER …
interaction (HCI). The Convolutional Neural Network (CNN) is widely used in SER …
Optimal Transport with Class Structure Exploration for Cross-Domain Speech Emotion Recognition
Speech emotion recognition (SER) has widespread applications in human-computer
interaction. However, the performance of SER models often drops in domain mismatch …
interaction. However, the performance of SER models often drops in domain mismatch …
[HTML][HTML] A Combined CNN Architecture for Speech Emotion Recognition
Emotion recognition through speech is a technique employed in various scenarios of
Human–Computer Interaction (HCI). Existing approaches have achieved significant results; …
Human–Computer Interaction (HCI). Existing approaches have achieved significant results; …