- Academic Search

Multi-task learning of structured output layer bidirectional LSTMs for speech synthesis

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

A deep learning approaches in text-to-speech system: a systematic review and recent research perspective

Y Kumar, A Koul, C Singh - Multimedia Tools and Applications, 2023 - Springer

Text-to-speech systems (TTS) have come a long way in the last decade and are now a
popular research topic for creating various human-computer interaction systems. Although, a …

Simpan Kutip Dirujuk 82 kali Artikel terkait 4 versi

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] A review of deep learning based speech synthesis

Y Ning, S He, Z Wu, C ** the model for speech synthesis of various aspects for
natural language processing. The speech synthesis explores by articulatory, formant and …

Simpan Kutip Dirujuk 70 kali Artikel terkait 3 versi Versi HTML

On the generalizability of two-dimensional convolutional neural networks for fake speech detection

C Papastergiopoulos, A Vafeiadis… - Proceedings of the 1st …, 2022 - dl.acm.org

The powerful capabilities of modern text-to-speech methods to produce synthetic computer
generated voice, can pose a problem in terms of discerning real from fake audio. In the …

Simpan Kutip Dirujuk 8 kali Artikel terkait

[Free GPT-4]
[DeepSeek]

[PDF] cuhk.edu.hk

[PDF][PDF] Knowledge-Based Linguistic Encoding for End-to-End Mandarin Text-to-Speech Synthesis.

J Li, Z Wu, R Li, P Zhi, S Yang, H Meng - INTERSPEECH, 2019 - se.cuhk.edu.hk

Recent researches have shown superior performance of applying end-to-end architecture in
text-to-speech (TTS) synthesis. However, considering the complex linguistic structure of …

Simpan Kutip Dirujuk 16 kali Artikel terkait 3 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] amazonaws.com

Assistive systems for visually impaired people: A survey on current requirements and advancements

P Kathiria, SH Mankad, J Patel, M Kapadia… - Neurocomputing, 2024 - Elsevier

In this survey, we provide a comprehensive study on the assistive technological devices
which help visually impaired persons in their day-to-day lives. With various forms of …

Simpan Kutip Artikel terkait 3 versi

[Free GPT-4]
[DeepSeek]

[PDF] tsinghua.edu.cn

Emphasis detection for voice dialogue applications using multi-channel convolutional bidirectional long short-term memory network

L Zhang, J Jia, F Meng, S Zhou, W Chen… - … on Chinese Spoken …, 2018 - ieeexplore.ieee.org

Emphasis detection is important for user intention understanding in human-computer
interaction scenario. Techniques have been developed to detect the emphatic words in …

Simpan Kutip Dirujuk 7 kali Artikel terkait 3 versi

Statistical parametric speech synthesis using generalized distillation framework

ZC Liu, ZH Ling, LR Dai - IEEE Signal Processing Letters, 2018 - ieeexplore.ieee.org

This letter proposes an improved statistical parametric speech synthesis (SPSS) method
which utilizes auxiliary information for acoustic modeling under generalized distillation …

Simpan Kutip Dirujuk 8 kali Artikel terkait 2 versi

Concatenative text-to-speech synthesis system for communication recognition

RK Jaiswal, RK Dubey - 2021 5th International Conference on …, 2021 - ieeexplore.ieee.org

Text-to-speech (TTS) synthesis is one of the rapidly emerging areas of computer-to-human
interaction technology. Human-like speech is replicated by the computer with the …

Simpan Kutip Dirujuk 3 kali Artikel terkait

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

Multi-task learning of structured output layer bidirectional LSTMs for speech synthesis

A deep learning approaches in text-to-speech system: a systematic review and recent research perspective

[HTML][HTML] A review of deep learning based speech synthesis

On the generalizability of two-dimensional convolutional neural networks for fake speech detection

[PDF][PDF] Knowledge-Based Linguistic Encoding for End-to-End Mandarin Text-to-Speech Synthesis.

Assistive systems for visually impaired people: A survey on current requirements and advancements

Emphasis detection for voice dialogue applications using multi-channel convolutional bidirectional long short-term memory network

Statistical parametric speech synthesis using generalized distillation framework

Concatenative text-to-speech synthesis system for communication recognition