- Academic Search

H Wang, H Wu, Z He, L Huang, KW Church - Engineering, 2022 - Elsevier

After more than 70 years of evolution, great achievements have been made in machine
translation. Especially in recent years, translation quality has been greatly improved with the …

Save Cite Cited by 233 Related articles All 2 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Viola: Conditional language models for speech recognition, synthesis, and translation

T Wang, L Zhou, Z Zhang, Y Wu, S Liu… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org

Recent research shows a big convergence in model architecture, training objectives, and
inference methods across various tasks for different modalities. In this paper, we propose …

Save Cite Cited by 102 Related articles All 2 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation

L Barrault, YA Chung, MC Meglioli, D Dale… - arxiv preprint arxiv …, 2023 - arxiv.org

What does it take to create the Babel Fish, a tool that can help individuals translate speech
between any two languages? While recent breakthroughs in text-based models have …

Save Cite Cited by 108 Related articles View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Direct speech-to-speech translation with discrete units

A Lee, PJ Chen, C Wang, J Gu, S Popuri, X Ma… - arxiv preprint arxiv …, 2021 - arxiv.org

We present a direct speech-to-speech translation (S2ST) model that translates speech from
one language to speech in another language without relying on intermediate text …

Save Cite Cited by 176 Related articles All 7 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Direct speech-to-speech translation with a sequence-to-sequence model

Y Jia, RJ Weiss, F Biadsy, W Macherey… - arxiv preprint arxiv …, 2019 - arxiv.org

We present an attention-based sequence-to-sequence neural network which can directly
translate speech from one language into speech in another language, without relying on an …

Save Cite Cited by 252 Related articles All 10 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Daspeech: Directed acyclic transformer for fast and high-quality speech-to-speech translation

Q Fang, Y Zhou, Y Feng - Advances in Neural Information …, 2023 - proceedings.neurips.cc

Direct speech-to-speech translation (S2ST) translates speech from one language into
another using a single model. However, due to the presence of linguistic and acoustic …

Save Cite Cited by 8 Related articles All 5 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Unity: Two-pass direct speech-to-speech translation with discrete units

H Inaguma, S Popuri, I Kulikov, PJ Chen… - arxiv preprint arxiv …, 2022 - arxiv.org

Direct speech-to-speech translation (S2ST), in which all components can be optimized
jointly, is advantageous over cascaded approaches to achieve fast inference with a …

Save Cite Cited by 47 Related articles All 6 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Enhanced direct speech-to-speech translation using self-supervised pre-training and data augmentation

S Popuri, PJ Chen, C Wang, J Pino, Y Adi, J Gu… - arxiv preprint arxiv …, 2022 - arxiv.org

Direct speech-to-speech translation (S2ST) models suffer from data scarcity issues as there
exists little parallel S2ST data, compared to the amount of data available for conventional …

Save Cite Cited by 68 Related articles All 6 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Transpeech: Speech-to-speech translation with bilateral perturbation

R Huang, J Liu, H Liu, Y Ren, L Zhang, J He… - arxiv preprint arxiv …, 2022 - arxiv.org

Direct speech-to-speech translation (S2ST) with discrete units leverages recent progress in
speech representation learning. Specifically, a sequence of discrete representations derived …

Save Cite Cited by 49 Related articles All 4 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Polyvoice: Language models for speech to speech translation

Q Dong, Z Huang, Q Tian, C Xu, T Ko, Y Zhao… - arxiv preprint arxiv …, 2023 - arxiv.org

We propose PolyVoice, a language model-based framework for speech-to-speech
translation (S2ST) system. Our framework consists of two language models: a translation …

Save Cite Cited by 24 Related articles All 2 versions Free GPT-4 DeepSeek View as HTML

Create alert

Cite

Advanced search

Saved to My library

The ATR multilingual speech-to-speech translation system

[HTML][HTML] Progress in machine translation

Viola: Conditional language models for speech recognition, synthesis, and translation

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation

Direct speech-to-speech translation with discrete units

Direct speech-to-speech translation with a sequence-to-sequence model

Daspeech: Directed acyclic transformer for fast and high-quality speech-to-speech translation

Unity: Two-pass direct speech-to-speech translation with discrete units

Enhanced direct speech-to-speech translation using self-supervised pre-training and data augmentation

Transpeech: Speech-to-speech translation with bilateral perturbation

Polyvoice: Language models for speech to speech translation