A deep learning approaches in text-to-speech system: a systematic review and recent research perspective

Y Kumar, A Koul, C Singh - Multimedia Tools and Applications, 2023 - Springer
Text-to-speech systems (TTS) have come a long way in the last decade and are now a
popular research topic for creating various human-computer interaction systems. Although, a …

On the generalizability of two-dimensional convolutional neural networks for fake speech detection

C Papastergiopoulos, A Vafeiadis… - Proceedings of the 1st …, 2022 - dl.acm.org
The powerful capabilities of modern text-to-speech methods to produce synthetic computer
generated voice, can pose a problem in terms of discerning real from fake audio. In the …

[PDF][PDF] Knowledge-Based Linguistic Encoding for End-to-End Mandarin Text-to-Speech Synthesis.

J Li, Z Wu, R Li, P Zhi, S Yang, H Meng - INTERSPEECH, 2019 - se.cuhk.edu.hk
Recent researches have shown superior performance of applying end-to-end architecture in
text-to-speech (TTS) synthesis. However, considering the complex linguistic structure of …

Assistive systems for visually impaired people: A survey on current requirements and advancements

P Kathiria, SH Mankad, J Patel, M Kapadia… - Neurocomputing, 2024 - Elsevier
In this survey, we provide a comprehensive study on the assistive technological devices
which help visually impaired persons in their day-to-day lives. With various forms of …

Emphasis detection for voice dialogue applications using multi-channel convolutional bidirectional long short-term memory network

L Zhang, J Jia, F Meng, S Zhou, W Chen… - … on Chinese Spoken …, 2018 - ieeexplore.ieee.org
Emphasis detection is important for user intention understanding in human-computer
interaction scenario. Techniques have been developed to detect the emphatic words in …

Statistical parametric speech synthesis using generalized distillation framework

ZC Liu, ZH Ling, LR Dai - IEEE Signal Processing Letters, 2018 - ieeexplore.ieee.org
This letter proposes an improved statistical parametric speech synthesis (SPSS) method
which utilizes auxiliary information for acoustic modeling under generalized distillation …

Concatenative text-to-speech synthesis system for communication recognition

RK Jaiswal, RK Dubey - 2021 5th International Conference on …, 2021 - ieeexplore.ieee.org
Text-to-speech (TTS) synthesis is one of the rapidly emerging areas of computer-to-human
interaction technology. Human-like speech is replicated by the computer with the …