Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Unity: Two-pass direct speech-to-speech translation with discrete units
Direct speech-to-speech translation (S2ST), in which all components can be optimized
jointly, is advantageous over cascaded approaches to achieve fast inference with a …
jointly, is advantageous over cascaded approaches to achieve fast inference with a …
End-to-end speech-to-text translation: A survey
N Sethiya, CK Maurya - Computer Speech & Language, 2024 - Elsevier
Abstract Speech-to-Text (ST) translation pertains to the task of converting speech signals in
one language to text in another language. It finds its application in various domains, such as …
one language to text in another language. It finds its application in various domains, such as …
[PDF][PDF] Prompting the hidden talent of web-scale speech models for zero-shot task generalization
We investigate the emergent abilities of the recently proposed web-scale speech model
Whisper, by adapting it to unseen tasks with prompt engineering. We selected three tasks …
Whisper, by adapting it to unseen tasks with prompt engineering. We selected three tasks …
Translatotron 3: Speech to speech translation with monolingual data
This paper presents Translatotron 3, a novel approach to unsupervised direct speech-to-
speech translation from monolingual speech-text datasets by combining masked …
speech translation from monolingual speech-text datasets by combining masked …
Joint pre-training with speech and bilingual text for direct speech to speech translation
Direct speech-to-speech translation (S2ST) is an attractive research topic with many
advantages compared to cascaded S2ST. However, direct S2ST suffers from the data …
advantages compared to cascaded S2ST. However, direct S2ST suffers from the data …
Direct Speech-to-Speech Neural Machine Translation: A Survey
M Gupta, M Dutta, CK Maurya - arxiv preprint arxiv:2411.14453, 2024 - arxiv.org
Speech-to-Speech Translation (S2ST) models transform speech from one language to
another target language with the same linguistic information. S2ST is important for bridging …
another target language with the same linguistic information. S2ST is important for bridging …
Improving cascaded unsupervised speech translation with denoising back-translation
Most of the speech translation models heavily rely on parallel data, which is hard to collect
especially for low-resource languages. To tackle this issue, we propose to build a cascaded …
especially for low-resource languages. To tackle this issue, we propose to build a cascaded …
Kazakh-Uzbek speech cascade machine translation on complete set of endings
Studies of speech-to-speech machine translation for Turkic languages are practically absent
due to the difficulties of creating parallel speech corpora for training neural models …
due to the difficulties of creating parallel speech corpora for training neural models …
SimulTron: On-Device Simultaneous Speech to Speech Translation
Simultaneous speech-to-speech translation (S2ST) holds the promise of breaking down
communication barriers and enabling fluid conversations across languages. However …
communication barriers and enabling fluid conversations across languages. However …
Transformer-Based End-to-End Speech Translation With Rotary Position Embedding
Recently, many Transformer-based models have been applied to end-to-end speech
translation because of their capability to model global dependencies. Position embedding is …
translation because of their capability to model global dependencies. Position embedding is …