Google 학술 검색

A Lee, PJ Chen, C Wang, J Gu, S Popuri, X Ma… - arxiv preprint arxiv …, 2021 - arxiv.org

We present a direct speech-to-speech translation (S2ST) model that translates speech from
one language to speech in another language without relying on intermediate text …

저장 인용 177회 인용 관련 학술자료 전체 7개의 버전 HTML 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Seamless: Multilingual Expressive and Streaming Speech Translation

L Barrault, YA Chung, MC Meglioli, D Dale… - arxiv preprint arxiv …, 2023 - arxiv.org

Large-scale automatic speech translation systems today lack key features that help machine-
mediated communication feel seamless when compared to human-to-human dialogue. In …

저장 인용 105회 인용 관련 학술자료 HTML 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Direct speech-to-speech translation with a sequence-to-sequence model

Y Jia, RJ Weiss, F Biadsy, W Macherey… - arxiv preprint arxiv …, 2019 - arxiv.org

We present an attention-based sequence-to-sequence neural network which can directly
translate speech from one language into speech in another language, without relying on an …

[Free GPT-4]
[DeepSeek]

[PDF] googleapis.com

Enhanced speech-to-speech translation system and methods for adding a new word

A Waibel, IR Lane - US Patent 8,972,268, 2015 - Google Patents

(57) ABSTRACT A speech translation system and methods for cross-lingual communication
that enable users to improve and modify con tent and usage of the system and easily abort …

저장 인용 454회 인용 관련 학술자료 전체 4개의 버전 저장된 페이지

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Speech translation and the end-to-end promise: Taking stock of where we are

M Sperber, M Paulik - arxiv preprint arxiv:2004.06358, 2020 - arxiv.org

Over its three decade history, speech translation has experienced several shifts in its
primary research themes; moving from loosely coupled cascades of speech recognition and …

저장 인용 111회 인용 관련 학술자료 전체 4개의 버전 HTML 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A holistic cascade system, benchmark, and human evaluation protocol for expressive speech-to-speech translation

WC Huang, B Peloquin, J Kao, C Wang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

Expressive speech-to-speech translation (S2ST) aims to transfer prosodic attributes of
source speech to target speech while maintaining translation accuracy. Existing research in …

저장 인용 19회 인용 관련 학술자료 전체 4개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] nii.ac.jp

End-to-end speech translation with transcoding by multi-task learning for distant language pairs

T Kano, S Sakti, S Nakamura - IEEE/ACM Transactions on …, 2020 - ieeexplore.ieee.org

Directly translating spoken utterances from a source language to a target language is
challenging because it requires a fundamental transformation in both linguistic and para/non …

저장 인용 43회 인용 관련 학술자료 전체 5개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Structured-based curriculum learning for end-to-end english-japanese speech translation

T Kano, S Sakti, S Nakamura - arxiv preprint arxiv:1802.06003, 2018 - arxiv.org

Sequence-to-sequence attentional-based neural network architectures have been shown to
provide a powerful model for machine translation and speech recognition. Recently, several …

저장 인용 61회 인용 관련 학술자료 전체 7개의 버전 HTML 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Enhancing speech-to-speech translation with multiple tts targets

J Shi, Y Tang, A Lee, H Inaguma… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

It has been known that direct speech-to-speech translation (S2ST) models usually suffer
from the data scarcity issue because of the limited existing parallel materials for both source …

저장 인용 7회 인용 관련 학술자료 전체 5개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Controlling prosody in end-to-end TTS: A case study on contrastive focus generation

S Latif, I Kim, I Calapodescu… - Proceedings of the 25th …, 2021 - aclanthology.org

Abstract While End-2-End Text-to-Speech (TTS) has made significant progresses over the
past few years, these systems still lack intuitive user controls over prosody. For instance …

저장 인용 13회 인용 관련 학술자료 전체 4개의 버전 HTML 버전

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Prosody generation for speech-to-speech translation

Direct speech-to-speech translation with discrete units

Seamless: Multilingual Expressive and Streaming Speech Translation

Direct speech-to-speech translation with a sequence-to-sequence model

Enhanced speech-to-speech translation system and methods for adding a new word

Speech translation and the end-to-end promise: Taking stock of where we are

A holistic cascade system, benchmark, and human evaluation protocol for expressive speech-to-speech translation

End-to-end speech translation with transcoding by multi-task learning for distant language pairs

Structured-based curriculum learning for end-to-end english-japanese speech translation

Enhancing speech-to-speech translation with multiple tts targets

Controlling prosody in end-to-end TTS: A case study on contrastive focus generation