Google Akademik

Z Wu, N Evans, T Kinnunen, J Yamagishi, F Alegre… - speech …, 2015 - Elsevier

While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …

Kaydet Alıntı yap Alıntılanma sayısı: 766 İlgili makaleler 13 sürümün hepsi

Conventional and contemporary approaches used in text to speech synthesis: A review

N Kaur, P Singh - Artificial Intelligence Review, 2023 - Springer

Nowadays speech synthesis or text to speech (TTS), an ability of system to produce human
like natural sounding voice from the written text, is gaining popularity in the field of speech …

Kaydet Alıntı yap Alıntılanma sayısı: 59 İlgili makaleler 3 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arxiv preprint arxiv:2106.15561, 2021 - arxiv.org

Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

Kaydet Alıntı yap Alıntılanma sayısı: 467 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] academia.edu

[PDF][PDF] Wavenet: A generative model for raw audio

A Van Den Oord, S Dieleman, H Zen… - arxiv preprint arxiv …, 2016 - academia.edu

This paper introduces WaveNet, a deep neural network for generating raw audio waveforms.
The model is fully probabilistic and autoregressive, with the predictive distribution for each …

Kaydet Alıntı yap Alıntılanma sayısı: 5978 İlgili makaleler 10 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

One-shot voice conversion by separating speaker and content representations with instance normalization

J Chou, C Yeh, H Lee - arxiv preprint arxiv:1904.05742, 2019 - arxiv.org

Recently, voice conversion (VC) without parallel data has been successfully adapted to multi-
target scenario in which a single model is trained to convert the input voice to many different …

Kaydet Alıntı yap Alıntılanma sayısı: 286 İlgili makaleler 11 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] isca-archive.org

[PDF][PDF] Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech.

C Valentini-Botinhao, X Wang, S Takaki, J Yamagishi - SSW, 2016 - isca-archive.org

The quality of text-to-speech (TTS) voices built from noisy speech is compromised.
Enhancing the speech data before training has been shown to improve quality but voices …

Kaydet Alıntı yap Alıntılanma sayısı: 477 İlgili makaleler 7 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] researchgate.net

Statistical parametric speech synthesis using deep neural networks

H Zen, A Senior, M Schuster - 2013 ieee international …, 2013 - ieeexplore.ieee.org

Conventional approaches to statistical parametric speech synthesis typically use decision
tree-clustered context-dependent hidden Markov models (HMMs) to represent probability …

Kaydet Alıntı yap Alıntılanma sayısı: 1181 İlgili makaleler 14 sürümün hepsi

[Free GPT-4]

[PDF] audentia-gestion.fr

Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis

H Zen, H Sak - … Conference on Acoustics, Speech and Signal …, 2015 - ieeexplore.ieee.org

Long short-term memory recurrent neural networks (LSTM-RNNs) have been applied to
various speech applications including acoustic modeling for statistical parametric speech …

Kaydet Alıntı yap Alıntılanma sayısı: 395 İlgili makaleler 12 sürümün hepsi

[Free GPT-4]

[PDF] nii.ac.jp

Statistical parametric speech synthesis

H Zen, K Tokuda, AW Black - speech communication, 2009 - Elsevier

This review gives a general overview of techniques used in statistical parametric speech
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …

Kaydet Alıntı yap Alıntılanma sayısı: 1665 İlgili makaleler 25 sürümün hepsi

[Free GPT-4]

[PDF] cmu.edu

Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory

T Toda, AW Black, K Tokuda - IEEE Transactions on Audio …, 2007 - ieeexplore.ieee.org

In this paper, we describe a novel spectral conversion method for voice conversion (VC). A
Gaussian mixture model (GMM) of the joint probability density of source and target features …

Kaydet Alıntı yap Alıntılanma sayısı: 1285 İlgili makaleler 15 sürümün hepsi

Uyarı oluştur

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

A speech parameter generation algorithm considering global variance for HMM-based speech synthesis

Spoofing and countermeasures for speaker verification: A survey

Conventional and contemporary approaches used in text to speech synthesis: A review

A survey on neural speech synthesis

[PDF][PDF] Wavenet: A generative model for raw audio

One-shot voice conversion by separating speaker and content representations with instance normalization

[PDF][PDF] Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech.

Statistical parametric speech synthesis using deep neural networks

Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis

Statistical parametric speech synthesis

Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory