- Academic Search

H Purwins, B Li, T Virtanen, J Schlüter… - IEEE Journal of …, 2019 - ieeexplore.ieee.org

Given the recent surge in developments of deep learning, this paper provides a review of the
state-of-the-art deep learning techniques for audio signal processing. Speech, music, and …

Save Cite Cited by 921 Related articles All 7 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Sequence-to-sequence models can directly translate foreign speech

RJ Weiss, J Chorowski, N Jaitly, Y Wu… - arxiv preprint arxiv …, 2017 - arxiv.org

We present a recurrent encoder-decoder deep neural network architecture that directly
translates speech in one language into text in another. The model does not explicitly …

Save Cite Cited by 429 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Multilingual speech translation with efficient finetuning of pretrained models

X Li, C Wang, Y Tang, C Tran, Y Tang, J Pino… - arxiv preprint arxiv …, 2020 - arxiv.org

We present a simple yet effective approach to build multilingual speech-to-text (ST)
translation by efficient transfer learning from pretrained speech encoder and text decoder …

Save Cite Cited by 144 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

End-to-end speech-to-text translation: A survey

N Sethiya, CK Maurya - Computer Speech & Language, 2024 - Elsevier

Abstract Speech-to-Text (ST) translation pertains to the task of converting speech signals in
one language to text in another language. It finds its application in various domains, such as …

Save Cite Cited by 6 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Tied multitask learning for neural speech translation

A Anastasopoulos, D Chiang - arxiv preprint arxiv:1802.06655, 2018 - arxiv.org

We explore multitask models for neural translation of speech, augmenting them in order to
reflect two intuitive notions. First, we introduce a model where the second task decoder …

Save Cite Cited by 193 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Speech translation and the end-to-end promise: Taking stock of where we are

M Sperber, M Paulik - arxiv preprint arxiv:2004.06358, 2020 - arxiv.org

Over its three decade history, speech translation has experienced several shifts in its
primary research themes; moving from loosely coupled cascades of speech recognition and …

Save Cite Cited by 110 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] springer.com

Multimodal machine translation through visuals and speech

U Sulubacak, O Caglayan, SA Grönroos, A Rouhe… - Machine …, 2020 - Springer

Multimodal machine translation involves drawing information from more than one modality,
based on the assumption that the additional modalities will contain useful alternative views …

Save Cite Cited by 86 Related articles All 18 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Covost: A diverse multilingual speech-to-text translation corpus

C Wang, J Pino, A Wu, J Gu - arxiv preprint arxiv:2002.01320, 2020 - arxiv.org

Spoken language translation has recently witnessed a resurgence in popularity, thanks to
the development of end-to-end models and the creation of new corpora, such as Augmented …

Save Cite Cited by 89 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

A comparative study on end-to-end speech to text translation

P Bahar, T Bieschke, H Ney - 2019 IEEE Automatic Speech …, 2019 - ieeexplore.ieee.org

Recent advances in deep learning show that end-to-end speech to text translation model is
a promising approach to direct the speech translation field. In this work, we provide an …

Save Cite Cited by 85 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

A brief overview of unsupervised neural speech representation learning

L Borgholt, JD Havtorn, J Edin, L Maaløe… - arxiv preprint arxiv …, 2022 - arxiv.org

Unsupervised representation learning for speech processing has matured greatly in the last
few years. Work in computer vision and natural language processing has paved the way, but …

Save Cite Cited by 12 Related articles All 5 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Towards speech-to-text translation without speech recognition

Deep learning for audio signal processing

Sequence-to-sequence models can directly translate foreign speech

Multilingual speech translation with efficient finetuning of pretrained models

End-to-end speech-to-text translation: A survey

Tied multitask learning for neural speech translation

Speech translation and the end-to-end promise: Taking stock of where we are

Multimodal machine translation through visuals and speech

Covost: A diverse multilingual speech-to-text translation corpus

A comparative study on end-to-end speech to text translation

A brief overview of unsupervised neural speech representation learning