Dawn of the transformer era in speech emotion recognition: closing the valence gap

J Wagner, A Triantafyllopoulos… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Recent advances in transformer-based architectures have shown promise in several
machine learning tasks. In the audio domain, such architectures have been successfully …

A review on speech emotion recognition: a survey, recent advances, challenges, and the influence of noise

SM George, PM Ilyas - Neurocomputing, 2024 - Elsevier
Affective Computing systems can detect the emotional state and mindset of an individual.
Speech Emotion Recognition (SER) is a unimodal affect computing system based on …

Designing and Evaluating Speech Emotion Recognition Systems: A reality check case study with IEMOCAP

N Antoniou, A Katsamanis… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
There is an imminent need for guidelines and standard test sets to allow direct and fair
comparisons of speech emotion recognition (SER). While resources, such as the Interactive …

[PDF][PDF] Automatic Detection and Assessment of Alzheimer Disease Using Speech and Language Technologies in Low-Resource Scenarios.

R Pappagari, J Cho, S Joshi, L Moro-Velázquez… - Interspeech, 2021 - researchgate.net
In this study, we analyze the use of speech and speaker recognition technologies and
natural language processing to detect Alzheimer disease (AD) and estimate mini-mental …

EdgeYOLO: an edge-real-time object detector

S Liu, J Zha, J Sun, Z Li, G Wang - 2023 42nd Chinese Control …, 2023 - ieeexplore.ieee.org
An efficient, low-complexity, and anchor-free object detector based on the state-of-the-art
YOLO framework is proposed in this paper, which can be implemented in real time on edge …

Privacy-preserving speech emotion recognition through semi-supervised federated learning

V Tsouvalas, T Ozcelebi… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Speech Emotion Recognition (SER) refers to the recognition of human emotions from
natural speech. If done accurately, it can offer a number of benefits in building human …

Non-contrastive self-supervised learning for utterance-level information extraction from speech

J Cho, J Villalba, L Moro-Velazquez… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
In recent studies, self-supervised pre-trained models tend to outperform supervised pre-
trained models in transfer learning. In particular, self-supervised learning of utterance-level …

ATDA: Attentional temporal dynamic activation for speech emotion recognition

LY Liu, WZ Liu, J Zhou, HY Deng, L Feng - Knowledge-Based Systems, 2022 - Elsevier
Speech emotion recognition (SER) plays a vital role in intelligent human–computer
interaction (HCI). The Convolutional Neural Network (CNN) is widely used in SER …

Optimal Transport with Class Structure Exploration for Cross-Domain Speech Emotion Recognition

R Zhang, J Wei, X Lu, J Xu, Y Li, D **… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Speech emotion recognition (SER) has widespread applications in human-computer
interaction. However, the performance of SER models often drops in domain mismatch …

[HTML][HTML] A Combined CNN Architecture for Speech Emotion Recognition

R Begazo, A Aguilera, I Dongo, Y Cardinale - Sensors, 2024 - mdpi.com
Emotion recognition through speech is a technique employed in various scenarios of
Human–Computer Interaction (HCI). Existing approaches have achieved significant results; …