Code-switching in automatic speech recognition: The issues and future directions

MB Mustafa, MA Yusoof, HK Khalaf… - Applied Sciences, 2022 - mdpi.com
Code-switching (CS) in spoken language is where the speech has two or more languages
within an utterance. It is an unsolved issue in automatic speech recognition (ASR) research …

[HTML][HTML] Map** the bibliometrics landscape of AI in medicine: methodological study

J Shi, D Bendig, HC Vollmar, P Rasche - Journal of Medical Internet …, 2023 - jmir.org
Background Artificial intelligence (AI), conceived in the 1950s, has permeated numerous
industries, intensifying in tandem with advancements in computing power. Despite the …

New approach in quantification of emotional intensity from the speech signal: emotional temperature

JB Alonso, J Cabrera, M Medina… - Expert Systems with …, 2015 - Elsevier
The automatic speech emotion recognition has a huge potential in applications of fields
such as psychology, psychiatry and the affective computing technology. The spontaneous …

Continuous tracking of the emotion temperature

JB Alonso, J Cabrera, CM Travieso, K López-de-Ipiña… - Neurocomputing, 2017 - Elsevier
The speech emotion recognition has a huge potential in human computer interaction
applications in fields such as psychology, psychiatry and affective computing technology …

A Context-Based Numerical Format Prediction for a Text-To-Speech System

Y Darwesh, LW Wern, MB Mustafa - ar** an HMM-based speech synthesis system for Malay: a comparison of iterative and isolated unit training
MB Mustafa, ZM Don, RN Ainon… - … on Information and …, 2014 - search.ieice.org
The development of an HMM-based speech synthesis system for a new language requires
resources like speech database and segment-phonetic labels. As an under-resourced …

ET-GAN: cross-language emotion transfer based on cycle-consistent generative adversarial networks

X Jia, J Tai, H Zhou, Y Li, W Zhang, H Du, Q Huang - ECAI 2020, 2020 - ebooks.iospress.nl
Despite the remarkable progress made in synthesizing emotional speech from text, it is still
challenging to provide emotion information to existing speech segments. Previous methods …

An analysis of Malay language emotional speech corpus for emotion recognition system

N Apandi, N Jamil - 2016 IEEE Industrial Electronics and …, 2016 - ieeexplore.ieee.org
Human speech is known as the information carrier because its signals can convey people's
emotions, age, gender, and ethnic. Speech and emotions are interrelated where from …

A Kullback-Leibler divergence based recurrent mixture density network for acoustic modeling in emotional statistical parametric speech synthesis

X An, Y Zhang, B Liu, L Xue, L **e - … of the Joint Workshop of the 4th …, 2018 - dl.acm.org
This paper proposes a Kullback-Leibler divergence (KLD) based recurrent mixture density
network (RMDN) approach for acoustic modeling in emotional statistical parametric speech …

Code-Switching in Automatic Speech Recognition: The Issues and Future Directions

M Begum Mustafa, MA Yusoof, HK Al-Ani… - 2022 - figshare.cardiffmet.ac.uk
Code-switching (CS) in spoken language is where the speech has two or more languages
within an utterance. It is an unsolved issue in automatic speech recognition (ASR) research …