Speech technology for healthcare: Opportunities, challenges, and state of the art
Speech technology is not appropriately explored even though modern advances in speech
technology—especially those driven by deep learning (DL) technology—offer …
technology—especially those driven by deep learning (DL) technology—offer …
A review of multimodal emotion recognition from datasets, preprocessing, features, and fusion methods
B Pan, K Hirota, Z Jia, Y Dai - Neurocomputing, 2023 - Elsevier
Affective computing is one of the most important research fields in modern human–computer
interaction (HCI). The goal of affective computing is to study and develop the theories …
interaction (HCI). The goal of affective computing is to study and develop the theories …
Survey of deep representation learning for speech emotion recognition
Traditionally, speech emotion recognition (SER) research has relied on manually
handcrafted acoustic features using feature engineering. However, the design of …
handcrafted acoustic features using feature engineering. However, the design of …
End to end multi-task learning with attention for multi-objective fault diagnosis under small sample
In recent years, deep learning (DL) based intelligent fault diagnosis method has been widely
applied in the field of equipment fault diagnosis. However, most of the existing methods are …
applied in the field of equipment fault diagnosis. However, most of the existing methods are …
A survey on deep reinforcement learning for audio-based applications
Deep reinforcement learning (DRL) is poised to revolutionise the field of artificial intelligence
(AI) by endowing autonomous systems with high levels of understanding of the real world …
(AI) by endowing autonomous systems with high levels of understanding of the real world …
Self supervised adversarial domain adaptation for cross-corpus and cross-language speech emotion recognition
Despite the recent advancement in speech emotion recognition (SER) within a single corpus
setting, the performance of these SER systems degrades significantly for cross-corpus and …
setting, the performance of these SER systems degrades significantly for cross-corpus and …
Deep representation learning in speech processing: Challenges, recent advances, and future trends
Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …
engineered acoustic features (feature engineering) as a separate distinct problem from the …
Learning multiple dense prediction tasks from partially annotated data
Despite the recent advances in multi-task learning of dense prediction problems, most
methods rely on expensive labelled datasets. In this paper, we present a label efficient …
methods rely on expensive labelled datasets. In this paper, we present a label efficient …
A deep interpretable representation learning method for speech emotion recognition
This paper focuses on the active interpretability for deep learning-based speech emotion
recognition (SER). To achieve this, we propose an explicit feature constrained model, the …
recognition (SER). To achieve this, we propose an explicit feature constrained model, the …
A comprehensive survey on multi-modal conversational emotion recognition with deep learning
Multi-modal conversation emotion recognition (MCER) aims to recognize and track the
speaker's emotional state using text, speech, and visual information in the conversation …
speaker's emotional state using text, speech, and visual information in the conversation …