Google Tudós

Y Jia, RJ Weiss, F Biadsy, W Macherey… - arxiv preprint arxiv …, 2019 - arxiv.org

We present an attention-based sequence-to-sequence neural network which can directly
translate speech from one language into speech in another language, without relying on an …

Mentés Hivatkozás Idézetek száma: 251 Kapcsolódó cikkek Mind a(z) 10 változat HTML-változat

[Free GPT-4]

[PDF] arxiv.org

A generative model for raw audio using transformer architectures

P Verma, C Chafe - … Conference on Digital Audio Effects (DAFx …, 2021 - ieeexplore.ieee.org

This paper proposes a novel way of doing audio synthesis at the waveform level using
Transformer architectures. We propose a deep neural network for generating waveforms …

Mentés Hivatkozás Idézetek száma: 45 Kapcsolódó cikkek Mind a(z) 8 változat

[Free GPT-4]

[PDF] nature.com

Tibetan–Chinese speech-to-speech translation based on discrete units

Z Gong, X Xu, Y Zhao - Scientific Reports, 2025 - nature.com

Speech-to-speech translation (S2ST) has evolved from cascade systems which integrate
Automatic Speech Recognition (ASR), Machine Translation (MT), and Text-to-Speech (TTS) …

Mentés Hivatkozás Kapcsolódó cikkek Mind a(z) 4 változat

[Free GPT-4]

[PDF] arxiv.org

Neural Architectures Learning Fourier Transforms, Signal Processing and Much More....

P Verma - arxiv preprint arxiv:2308.10388, 2023 - arxiv.org

This report will explore and answer fundamental questions about taking Fourier Transforms
and tying it with recent advances in AI and neural architecture. One interpretation of the …

Mentés Hivatkozás Idézetek száma: 1 Kapcsolódó cikkek Mind a(z) 4 változat HTML-változat

Kazakh-Uzbek Speech Cascade Machine Translation on Complete Set of Endings

T Balabekova, B Kairatuly, U Tukeyev - International Conference on …, 2023 - Springer

Studies of speech-to-speech machine translation for Turkic languages are practically absent
due to the difficulties of creating parallel speech corpora for training neural models …

Mentés Hivatkozás Idézetek száma: 3 Kapcsolódó cikkek

Multi-Task Self-Supervised Learning Based Tibetan-Chinese Speech-to-Speech Translation

R Liu, Y Zhao, X Xu - 2023 International Conference on Asian …, 2023 - ieeexplore.ieee.org

Speech-to-speech translation tasks are commonly tackled by using a three-level cascade
system which comprises of speech recognition, machine translation, and speech synthesis …

Mentés Hivatkozás Idézetek száma: 2 Kapcsolódó cikkek

[Free GPT-4]

[PDF] arxiv.org

Learning to model aspects of hearing perception using neural loss functions

P Verma, J Berger - arxiv preprint arxiv:1912.05683, 2019 - arxiv.org

We present a framework to model the perceived quality of audio signals by combining
convolutional architectures, with ideas from classical signal processing, and describe an …

Mentés Hivatkozás Idézetek száma: 2 Kapcsolódó cikkek Mind a(z) 3 változat HTML-változat

Értesítés létrehozása

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

End-to-end spoken language translation

Direct speech-to-speech translation with a sequence-to-sequence model

A generative model for raw audio using transformer architectures

Tibetan–Chinese speech-to-speech translation based on discrete units

Neural Architectures Learning Fourier Transforms, Signal Processing and Much More....

Kazakh-Uzbek Speech Cascade Machine Translation on Complete Set of Endings

Multi-Task Self-Supervised Learning Based Tibetan-Chinese Speech-to-Speech Translation

Learning to model aspects of hearing perception using neural loss functions