Google Tudós

T Baltrušaitis, C Ahuja… - IEEE transactions on …, 2018 - ieeexplore.ieee.org

Our experience of the world is multimodal-we see objects, hear sounds, feel texture, smell
odors, and taste flavors. Modality refers to the way in which something happens or is …

Mentés Hivatkozás Idézetek száma: 3890 Kapcsolódó cikkek Mind a(z) 11 változat

A deep learning approaches in text-to-speech system: a systematic review and recent research perspective

Y Kumar, A Koul, C Singh - Multimedia Tools and Applications, 2023 - Springer

Text-to-speech systems (TTS) have come a long way in the last decade and are now a
popular research topic for creating various human-computer interaction systems. Although, a …

Mentés Hivatkozás Idézetek száma: 82 Kapcsolódó cikkek Mind a(z) 4 változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arxiv preprint arxiv:2106.15561, 2021 - arxiv.org

Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

Mentés Hivatkozás Idézetek száma: 469 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] pubpub.org

[PDF][PDF] Jukebox: A generative model for music

P Dhariwal, H Jun, C Payne, JW Kim… - arxiv preprint arxiv …, 2020 - assets.pubpub.org

We introduce Jukebox, a model that generates music with singing in the raw audio domain.
We tackle the long context of raw audio using a multiscale VQ-VAE to compress it to discrete …

Mentés Hivatkozás Idézetek száma: 914 Kapcsolódó cikkek Mind a(z) 7 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Libritts: A corpus derived from librispeech for text-to-speech

H Zen, V Dang, R Clark, Y Zhang, RJ Weiss… - arxiv preprint arxiv …, 2019 - arxiv.org

This paper introduces a new speech corpus called" LibriTTS" designed for text-to-speech
use. It is derived from the original audio and text materials of the LibriSpeech corpus, which …

Mentés Hivatkozás Idézetek száma: 1061 Kapcsolódó cikkek Mind a(z) 6 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Neural speech synthesis with transformer network

N Li, S Liu, Y Liu, S Zhao, M Liu - … of the AAAI conference on artificial …, 2019 - ojs.aaai.org

Although end-to-end neural text-to-speech (TTS) methods (such as Tacotron2) are proposed
and achieve state-of-theart performance, they still suffer from two problems: 1) low efficiency …

Mentés Hivatkozás Idézetek száma: 924 Kapcsolódó cikkek Mind a(z) 10 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Natural tts synthesis by conditioning wavenet on mel spectrogram predictions

J Shen, R Pang, RJ Weiss, M Schuster… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org

This paper describes Tacotron 2, a neural network architecture for speech synthesis directly
from text. The system is composed of a recurrent sequence-to-sequence feature prediction …

Mentés Hivatkozás Idézetek száma: 3434 Kapcsolódó cikkek Mind a(z) 7 változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Tacotron: Towards end-to-end speech synthesis

Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu… - arxiv preprint arxiv …, 2017 - arxiv.org

A text-to-speech synthesis system typically consists of multiple stages, such as a text
analysis frontend, an acoustic model and an audio synthesis module. Building these …

Mentés Hivatkozás Idézetek száma: 2308 Kapcsolódó cikkek Mind a(z) 7 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] canterbury.ac.nz

[KÖNYV][B] Human-robot interaction: An introduction

C Bartneck, T Belpaeme, F Eyssel, T Kanda, M Keijsers… - 2024 - books.google.com

The role of robots in society keeps expanding and diversifying, bringing with it a host of
issues surrounding the relationship between robots and humans. This introduction to human …

Mentés Hivatkozás Idézetek száma: 360 Kapcsolódó cikkek Mind a(z) 8 változat Könyvtári keresés

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

[PDF][PDF] Wavenet: A generative model for raw audio

A Van Den Oord, S Dieleman, H Zen… - arxiv preprint arxiv …, 2016 - academia.edu

This paper introduces WaveNet, a deep neural network for generating raw audio waveforms.
The model is fully probabilistic and autoregressive, with the predictive distribution for each …

Mentés Hivatkozás Idézetek száma: 6082 Kapcsolódó cikkek Mind a(z) 9 változat HTML-változat

Értesítés létrehozása

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

Statistical parametric speech synthesis

Multimodal machine learning: A survey and taxonomy

A deep learning approaches in text-to-speech system: a systematic review and recent research perspective

A survey on neural speech synthesis

[PDF][PDF] Jukebox: A generative model for music

Libritts: A corpus derived from librispeech for text-to-speech

Neural speech synthesis with transformer network

Natural tts synthesis by conditioning wavenet on mel spectrogram predictions

Tacotron: Towards end-to-end speech synthesis

[KÖNYV][B] Human-robot interaction: An introduction

[PDF][PDF] Wavenet: A generative model for raw audio