Speech emotion recognition: a comprehensive survey

MJ Al-Dujaili, A Ebrahimi-Moghadam - Wireless Personal Communications, 2023 - Springer
Speech emotion recognition could be considered a new topic in speech processing where
he plays that plays an essential role in human interaction. Emotions are a king of speech …

LEAF: A learnable frontend for audio classification

N Zeghidour, O Teboul, FDC Quitry… - arxiv preprint arxiv …, 2021 - arxiv.org
Mel-filterbanks are fixed, engineered audio features which emulate human perception and
have been used through the history of audio understanding up to today. However, their …

Automatic speech emotion recognition using modulation spectral features

S Wu, TH Falk, WY Chan - Speech communication, 2011 - Elsevier
In this study, modulation spectral features (MSFs) are proposed for the automatic recognition
of human affective information from speech. The features are extracted from an auditory …

A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech

TH Falk, C Zheng, WY Chan - IEEE Transactions on Audio …, 2010 - ieeexplore.ieee.org
A modulation spectral representation is investigated for non-intrusive quality and
intelligibility measurement of reverberant and dereverberated speech. The representation is …

Emotion recognition using hybrid Gaussian mixture model and deep neural network

I Shahin, AB Nassif, S Hamsa - IEEE access, 2019 - ieeexplore.ieee.org
This paper aims at recognizing emotions for a text-independent and speaker-independent
emotion recognition system based on a novel classifier, which is a hybrid of a cascaded …

Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition

T Yoshioka, A Sehr, M Delcroix… - IEEE Signal …, 2012 - ieeexplore.ieee.org
Speech recognition technology has left the research laboratory and is increasingly coming
into practical use, enabling a wide spectrum of innovative and exciting voice-driven …

Voices obscured in complex environmental settings (voices) corpus

C Richey, MA Barrios, Z Armstrong, C Bartels… - arxiv preprint arxiv …, 2018 - arxiv.org
This paper introduces the Voices Obscured In Complex Environmental Settings (VOICES)
corpus, a freely available dataset under Creative Commons BY 4.0. This dataset will …

Robust speaker identification in noisy and reverberant conditions

X Zhao, Y Wang, DL Wang - IEEE/ACM Transactions on Audio …, 2014 - ieeexplore.ieee.org
Robustness of speaker recognition systems is crucial for real-world applications, which
typically contain both additive noise and room reverberation. However, the combined effects …

Hi-mia: A far-field text-dependent speaker verification database and the baselines

X Qin, H Bu, M Li - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org
This paper presents a far-field text-dependent speaker verification database named HI-MIA.
We aim to meet the data requirement for far-field microphone array based speaker …