Google znalac

X Tan, T Qin, F Soong, TY Liu - arxiv preprint arxiv:2106.15561, 2021 - arxiv.org

Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

Spremi Citiraj Spominje se 469 puta Srodni članci Svih 2 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Adaspeech: Adaptive text to speech for custom voice

M Chen, X Tan, B Li, Y Liu, T Qin, S Zhao… - arxiv preprint arxiv …, 2021 - arxiv.org

Custom voice, a specific text to speech (TTS) service in commercial speech platforms, aims
to adapt a source TTS model to synthesize personal voice for a target speaker using few …

Spremi Citiraj Spominje se 231 puta Srodni članci Svih 3 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Review of end-to-end speech synthesis technology based on deep learning

Z Mu, X Yang, Y Dong - arxiv preprint arxiv:2104.09995, 2021 - arxiv.org

As an indispensable part of modern human-computer interaction system, speech synthesis
technology helps users get the output of intelligent machine more easily and intuitively, thus …

Spremi Citiraj Spominje se 45 puta Srodni članci Svih 3 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Mega-tts 2: Boosting prompting mechanisms for zero-shot speech synthesis

Z Jiang, J Liu, Y Ren, J He, Z Ye, S Ji, Q Yang… - arxiv preprint arxiv …, 2023 - arxiv.org

Zero-shot text-to-speech (TTS) aims to synthesize voices with unseen speech prompts,
which significantly reduces the data and computation requirements for voice cloning by …

Spremi Citiraj Spominje se 35 puta Srodni članci Svih 4 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

VALL-E R: Robust and efficient zero-shot text-to-speech synthesis via monotonic alignment

B Han, L Zhou, S Liu, S Chen, L Meng, Y Qian… - arxiv preprint arxiv …, 2024 - arxiv.org

With the help of discrete neural audio codecs, large language models (LLM) have
increasingly been recognized as a promising methodology for zero-shot Text-to-Speech …

Spremi Citiraj Spominje se 17 puta Srodni članci Svih 4 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Usat: A universal speaker-adaptive text-to-speech approach

W Wang, Y Song, S Jha - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org

Conventional text-to-speech (TTS) research has predominantly focused on enhancing the
quality of synthesized speech for speakers in the training dataset. The challenge of …

Spremi Citiraj Spominje se 11 puta Srodni članci Svih 4 inačica

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Adaspeech 2: Adaptive text to speech with untranscribed data

Y Yan, X Tan, B Li, T Qin, S Zhao… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

Text to speech (TTS) is widely used to synthesize personal voice for a target speaker, where
a well-trained source TTS model is fine-tuned with few paired adaptation data (speech and …

Spremi Citiraj Spominje se 65 puta Srodni članci Svih 3 inačica

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

GANSpeech: Adversarial training for high-fidelity multi-speaker speech synthesis

J Yang, JS Bae, T Bak, Y Kim, HY Cho - arxiv preprint arxiv:2106.15153, 2021 - arxiv.org

Recent advances in neural multi-speaker text-to-speech (TTS) models have enabled the
generation of reasonably good speech quality with a single model and made it possible to …

Spremi Citiraj Spominje se 42 puta Srodni članci Svih 4 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Speaker generation

D Stanton, M Shannon, S Mariooryad… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

This work explores the task of synthesizing speech in non-existent human-sounding voices.
We call this task" speaker generation", and present TacoSpawn, a system that performs …

Spremi Citiraj Spominje se 36 puta Srodni članci Svih 5 inačica

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

An Overview of Deep Neural Networks for Few-Shot Learning

J Zhao, L Kong, J Lv - Big Data Mining and Analytics, 2024 - ieeexplore.ieee.org

Recent advancements in deep learning have led to significant breakthroughs across various
fields. However, these methods often require extensive labeled data for optimal …

Spremi Citiraj Srodni članci Svih 2 inačica

Stvori obavijest

Citiraj

Napredno pretraživanje

Spremljeno u Moju knjižnicu

Boffin tts: Few-shot speaker adaptation by bayesian optimization

A survey on neural speech synthesis

Adaspeech: Adaptive text to speech for custom voice

Review of end-to-end speech synthesis technology based on deep learning

Mega-tts 2: Boosting prompting mechanisms for zero-shot speech synthesis

VALL-E R: Robust and efficient zero-shot text-to-speech synthesis via monotonic alignment

Usat: A universal speaker-adaptive text-to-speech approach

Adaspeech 2: Adaptive text to speech with untranscribed data

GANSpeech: Adversarial training for high-fidelity multi-speaker speech synthesis

Speaker generation

An Overview of Deep Neural Networks for Few-Shot Learning