Direct speech-to-speech translation with a sequence-to-sequence model
We present an attention-based sequence-to-sequence neural network which can directly
translate speech from one language into speech in another language, without relying on an …
translate speech from one language into speech in another language, without relying on an …
Libris2s: A german-english speech-to-speech translation corpus
P Jeuris, J Niehues - arxiv preprint arxiv:2204.10593, 2022 - arxiv.org
Recently, we have seen an increasing interest in the area of speech-to-text translation. This
has led to astonishing improvements in this area. In contrast, the activities in the area of …
has led to astonishing improvements in this area. In contrast, the activities in the area of …
Intent transfer in speech-to-speech machine translation
This paper presents an approach for transfer of speaker intent in speech-to-speech machine
translation (S2SMT). Specifically, we describe techniques to retain the prominence patterns …
translation (S2SMT). Specifically, we describe techniques to retain the prominence patterns …
On the use of context for predicting citation worthiness of sentences in scholarly articles
In this paper, we study the importance of context in predicting the citation worthiness of
sentences in scholarly articles. We formulate this problem as a sequence labeling task …
sentences in scholarly articles. We formulate this problem as a sequence labeling task …
[PDF][PDF] The SIWIS database: a multilingual speech database with acted emphasis
We describe here a collection of speech data of bilingual and trilingual speakers of English,
French, German and Italian. In the context of speech to speech translation (S2ST), this …
French, German and Italian. In the context of speech to speech translation (S2ST), this …
[PDF][PDF] Variational Attention Using Articulatory Priors for Generating Code Mixed Speech Using Monolingual Corpora.
Code Mixing-phenomenon where lexical items from one language are embedded in the
utterance of another-is relatively frequent in multilingual communities and therefore speech …
utterance of another-is relatively frequent in multilingual communities and therefore speech …
Preliminary work on speaker adaptation for DNN-based speech synthesis
We investigate speaker adaptation in the context of deep neural network (DNN) based
speech synthesis. More specifically, our current work focuses on the exploitation of auxiliary …
speech synthesis. More specifically, our current work focuses on the exploitation of auxiliary …
[PDF][PDF] Verifying human users in speech-based interactions
Verifying that a live human is interacting with an automated speech based system is needed
in some applications such as biometric authentication. In this paper, we present a method to …
in some applications such as biometric authentication. In this paper, we present a method to …
Speaker-dependent model interpolation for statistical emotional speech synthesis
CY Hsu, CP Chen - EURASIP Journal on Audio, Speech, and Music …, 2012 - Springer
In this article, we propose a speaker-dependent model interpolation method for statistical
emotional speech synthesis. The basic idea is to combine the neutral model set of the target …
emotional speech synthesis. The basic idea is to combine the neutral model set of the target …
[PDF][PDF] Survey on speech, machine translation and gestures in ambient assisted living
D Anastasiou - Tralogy, Session, 2011 - researchgate.net
In this paper we provide the state-of-the-art of existing proprietary and free and open source
software (FOSS) automatic speech recognition (ASR), speech synthesizers, and Machine …
software (FOSS) automatic speech recognition (ASR), speech synthesizers, and Machine …