Direct speech-to-speech translation with a sequence-to-sequence model

Y Jia, RJ Weiss, F Biadsy, W Macherey… - arxiv preprint arxiv …, 2019 - arxiv.org
We present an attention-based sequence-to-sequence neural network which can directly
translate speech from one language into speech in another language, without relying on an …

Libris2s: A german-english speech-to-speech translation corpus

P Jeuris, J Niehues - arxiv preprint arxiv:2204.10593, 2022 - arxiv.org
Recently, we have seen an increasing interest in the area of speech-to-text translation. This
has led to astonishing improvements in this area. In contrast, the activities in the area of …

Intent transfer in speech-to-speech machine translation

GK Anumanchipalli, LC Oliveira… - 2012 IEEE Spoken …, 2012 - ieeexplore.ieee.org
This paper presents an approach for transfer of speaker intent in speech-to-speech machine
translation (S2SMT). Specifically, we describe techniques to retain the prominence patterns …

On the use of context for predicting citation worthiness of sentences in scholarly articles

R Gosangi, R Arora, M Gheisarieha, D Mahata… - arxiv preprint arxiv …, 2021 - arxiv.org
In this paper, we study the importance of context in predicting the citation worthiness of
sentences in scholarly articles. We formulate this problem as a sequence labeling task …

[PDF][PDF] The SIWIS database: a multilingual speech database with acted emphasis

JP Goldman, PE Honnet, R Clark, PN Garner… - Interspeech …, 2016 - infoscience.epfl.ch
We describe here a collection of speech data of bilingual and trilingual speakers of English,
French, German and Italian. In the context of speech to speech translation (S2ST), this …

[PDF][PDF] Variational Attention Using Articulatory Priors for Generating Code Mixed Speech Using Monolingual Corpora.

SK Rallabandi, AW Black - INTERSPEECH, 2019 - cs.cmu.edu
Code Mixing-phenomenon where lexical items from one language are embedded in the
utterance of another-is relatively frequent in multilingual communities and therefore speech …

Preliminary work on speaker adaptation for DNN-based speech synthesis

B Potard, P Motlicek, D Imseng - 2015 - infoscience.epfl.ch
We investigate speaker adaptation in the context of deep neural network (DNN) based
speech synthesis. More specifically, our current work focuses on the exploitation of auxiliary …

[PDF][PDF] Verifying human users in speech-based interactions

S Shirali-Shahreza, Y Ganjali… - … Annual Conference of …, 2011 - dgp.toronto.edu
Verifying that a live human is interacting with an automated speech based system is needed
in some applications such as biometric authentication. In this paper, we present a method to …

Speaker-dependent model interpolation for statistical emotional speech synthesis

CY Hsu, CP Chen - EURASIP Journal on Audio, Speech, and Music …, 2012 - Springer
In this article, we propose a speaker-dependent model interpolation method for statistical
emotional speech synthesis. The basic idea is to combine the neutral model set of the target …

[PDF][PDF] Survey on speech, machine translation and gestures in ambient assisted living

D Anastasiou - Tralogy, Session, 2011 - researchgate.net
In this paper we provide the state-of-the-art of existing proprietary and free and open source
software (FOSS) automatic speech recognition (ASR), speech synthesizers, and Machine …