Speech technology for healthcare: Opportunities, challenges, and state of the art

S Latif, J Qadir, A Qayyum, M Usama… - IEEE Reviews in …, 2020 - ieeexplore.ieee.org
Speech technology is not appropriately explored even though modern advances in speech
technology—especially those driven by deep learning (DL) technology—offer …

A review of multimodal emotion recognition from datasets, preprocessing, features, and fusion methods

B Pan, K Hirota, Z Jia, Y Dai - Neurocomputing, 2023 - Elsevier
Affective computing is one of the most important research fields in modern human–computer
interaction (HCI). The goal of affective computing is to study and develop the theories …

Survey of deep representation learning for speech emotion recognition

S Latif, R Rana, S Khalifa, R Jurdak… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
Traditionally, speech emotion recognition (SER) research has relied on manually
handcrafted acoustic features using feature engineering. However, the design of …

End to end multi-task learning with attention for multi-objective fault diagnosis under small sample

Z **e, J Chen, Y Feng, K Zhang, Z Zhou - Journal of Manufacturing Systems, 2022 - Elsevier
In recent years, deep learning (DL) based intelligent fault diagnosis method has been widely
applied in the field of equipment fault diagnosis. However, most of the existing methods are …

A survey on deep reinforcement learning for audio-based applications

S Latif, H Cuayáhuitl, F Pervez, F Shamshad… - Artificial Intelligence …, 2023 - Springer
Deep reinforcement learning (DRL) is poised to revolutionise the field of artificial intelligence
(AI) by endowing autonomous systems with high levels of understanding of the real world …

Self supervised adversarial domain adaptation for cross-corpus and cross-language speech emotion recognition

S Latif, R Rana, S Khalifa, R Jurdak… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Despite the recent advancement in speech emotion recognition (SER) within a single corpus
setting, the performance of these SER systems degrades significantly for cross-corpus and …

Deep representation learning in speech processing: Challenges, recent advances, and future trends

S Latif, R Rana, S Khalifa, R Jurdak, J Qadir… - arxiv preprint arxiv …, 2020 - arxiv.org
Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …

Learning multiple dense prediction tasks from partially annotated data

WH Li, X Liu, H Bilen - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
Despite the recent advances in multi-task learning of dense prediction problems, most
methods rely on expensive labelled datasets. In this paper, we present a label efficient …

A deep interpretable representation learning method for speech emotion recognition

E **g, Y Liu, Y Chai, J Sun, S Samtani, Y Jiang… - Information Processing …, 2023 - Elsevier
This paper focuses on the active interpretability for deep learning-based speech emotion
recognition (SER). To achieve this, we propose an explicit feature constrained model, the …

A comprehensive survey on multi-modal conversational emotion recognition with deep learning

Y Shou, T Meng, W Ai, N Yin, K Li - arxiv preprint arxiv:2312.05735, 2023 - arxiv.org
Multi-modal conversation emotion recognition (MCER) aims to recognize and track the
speaker's emotional state using text, speech, and visual information in the conversation …