A deep learning approaches in text-to-speech system: a systematic review and recent research perspective

Y Kumar, A Koul, C Singh - Multimedia Tools and Applications, 2023 - Springer
Text-to-speech systems (TTS) have come a long way in the last decade and are now a
popular research topic for creating various human-computer interaction systems. Although, a …

Controllable data generation by deep learning: A review

S Wang, Y Du, X Guo, B Pan, Z Qin, L Zhao - ACM Computing Surveys, 2024 - dl.acm.org
Designing and generating new data under targeted properties has been attracting various
critical applications such as molecule design, image editing and speech synthesis …

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arxiv preprint arxiv:2106.15561, 2021 - arxiv.org
Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

Speech synthesis with mixed emotions

K Zhou, B Sisman, R Rana… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Emotional speech synthesis aims to synthesize human voices with various emotional effects.
The current studies are mostly focused on imitating an averaged style belonging to a specific …

[HTML][HTML] Fatigue detection of air traffic controllers based on radiotelephony communications and self-adaption quantum genetic algorithm optimization ensemble …

N Wu, J Sun - Applied Sciences, 2022 - mdpi.com
Air traffic controller (ATC) fatigue has become a major cause of air traffic accidents. Speech-
based fatigue-state detection is proposed in this paper. The speech signal is preprocessed …

Deep learning-based expressive speech synthesis: a systematic review of approaches, challenges, and resources

H Barakat, O Turk, C Demiroglu - EURASIP Journal on Audio, Speech, and …, 2024 - Springer
Speech synthesis has made significant strides thanks to the transition from machine learning
to deep learning models. Contemporary text-to-speech (TTS) models possess the capability …

Speech enhancement from fused features based on deep neural network and gated recurrent unit network

Y Wang, J Han, T Zhang, D Qing - EURASIP Journal on Advances in …, 2021 - Springer
Speech is easily interfered by external environment in reality, which results in the loss of
important features. Deep learning has become a popular speech enhancement method …

Engineering cost prediction model based on DNN

B Li, Q **n, L Zhang - Scientific Programming, 2022 - Wiley Online Library
A DNN‐based cost prediction method is proposed for the difficult problem of cost calculation
in engineering cost accounting, combined with deep neural networks. Firstly, we introduce …

Emotion modelling for speech generation

K Zhou - 2023 - search.proquest.com
Speech generation aims to synthesize human-like voices from the input of text or speech.
Current speech generation techniques can generate high quality, natural-sounding speech …

FastSpeech2 Based Japanese Emotional Speech Synthesis

M Ikeda, K Markov - 2024 IEEE 12th International Conference …, 2024 - ieeexplore.ieee.org
Modeling emotions is an important part of text-to-speech (TTS) research since its goal is to
develop a technology for synthesizing naturally sounding speech. In this study, we aimed to …