- Academic Search

Y Jia, RJ Weiss, F Biadsy, W Macherey… - arxiv preprint arxiv …, 2019 - arxiv.org

We present an attention-based sequence-to-sequence neural network which can directly
translate speech from one language into speech in another language, without relying on an …

Speichern Zitieren Zitiert von: 251 Ähnliche Artikel Alle 10 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Libris2s: A german-english speech-to-speech translation corpus

P Jeuris, J Niehues - arxiv preprint arxiv:2204.10593, 2022 - arxiv.org

Recently, we have seen an increasing interest in the area of speech-to-text translation. This
has led to astonishing improvements in this area. In contrast, the activities in the area of …

Speichern Zitieren Zitiert von: 8 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] cmu.edu

Intent transfer in speech-to-speech machine translation

GK Anumanchipalli, LC Oliveira… - 2012 IEEE Spoken …, 2012 - ieeexplore.ieee.org

This paper presents an approach for transfer of speaker intent in speech-to-speech machine
translation (S2SMT). Specifically, we describe techniques to retain the prominence patterns …

Speichern Zitieren Zitiert von: 34 Ähnliche Artikel Alle 13 Versionen

[Free GPT-4]

[PDF] arxiv.org

On the use of context for predicting citation worthiness of sentences in scholarly articles

R Gosangi, R Arora, M Gheisarieha, D Mahata… - arxiv preprint arxiv …, 2021 - arxiv.org

In this paper, we study the importance of context in predicting the citation worthiness of
sentences in scholarly articles. We formulate this problem as a sequence labeling task …

Speichern Zitieren Zitiert von: 11 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] epfl.ch

[PDF][PDF] The SIWIS database: a multilingual speech database with acted emphasis

JP Goldman, PE Honnet, R Clark, PN Garner… - Interspeech …, 2016 - infoscience.epfl.ch

We describe here a collection of speech data of bilingual and trilingual speakers of English,
French, German and Italian. In the context of speech to speech translation (S2ST), this …

Speichern Zitieren Zitiert von: 18 Ähnliche Artikel Alle 20 Versionen HTML-Version

[Free GPT-4]

[PDF] cmu.edu

[PDF][PDF] Variational Attention Using Articulatory Priors for Generating Code Mixed Speech Using Monolingual Corpora.

SK Rallabandi, AW Black - INTERSPEECH, 2019 - cs.cmu.edu

Code Mixing-phenomenon where lexical items from one language are embedded in the
utterance of another-is relatively frequent in multilingual communities and therefore speech …

Speichern Zitieren Zitiert von: 15 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] epfl.ch

Preliminary work on speaker adaptation for DNN-based speech synthesis

B Potard, P Motlicek, D Imseng - 2015 - infoscience.epfl.ch

We investigate speaker adaptation in the context of deep neural network (DNN) based
speech synthesis. More specifically, our current work focuses on the exploitation of auxiliary …

Speichern Zitieren Zitiert von: 19 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] toronto.edu

[PDF][PDF] Verifying human users in speech-based interactions

S Shirali-Shahreza, Y Ganjali… - … Annual Conference of …, 2011 - dgp.toronto.edu

Verifying that a live human is interacting with an automated speech based system is needed
in some applications such as biometric authentication. In this paper, we present a method to …

Speichern Zitieren Zitiert von: 15 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]

[PDF] springer.com

Speaker-dependent model interpolation for statistical emotional speech synthesis

CY Hsu, CP Chen - EURASIP Journal on Audio, Speech, and Music …, 2012 - Springer

In this article, we propose a speaker-dependent model interpolation method for statistical
emotional speech synthesis. The basic idea is to combine the neutral model set of the target …

Speichern Zitieren Zitiert von: 10 Ähnliche Artikel Alle 10 Versionen

[Free GPT-4]

[PDF] researchgate.net

[PDF][PDF] Survey on speech, machine translation and gestures in ambient assisted living

D Anastasiou - Tralogy, Session, 2011 - researchgate.net

In this paper we provide the state-of-the-art of existing proprietary and free and open source
software (FOSS) automatic speech recognition (ASR), speech synthesizers, and Machine …

Speichern Zitieren Zitiert von: 8 Ähnliche Artikel Alle 2 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Personalising speech-to-speech translation in the EMIME project

Direct speech-to-speech translation with a sequence-to-sequence model

Libris2s: A german-english speech-to-speech translation corpus

Intent transfer in speech-to-speech machine translation

On the use of context for predicting citation worthiness of sentences in scholarly articles

[PDF][PDF] The SIWIS database: a multilingual speech database with acted emphasis

[PDF][PDF] Variational Attention Using Articulatory Priors for Generating Code Mixed Speech Using Monolingual Corpora.

Preliminary work on speaker adaptation for DNN-based speech synthesis

[PDF][PDF] Verifying human users in speech-based interactions

Speaker-dependent model interpolation for statistical emotional speech synthesis

[PDF][PDF] Survey on speech, machine translation and gestures in ambient assisted living