Speaker de-identification using diphone recognition and speech synthesis
The paper addresses the problem of speaker (or voice) de-identification by presenting a
novel approach for concealing the identity of speakers in their speech. The proposed …
novel approach for concealing the identity of speakers in their speech. The proposed …
KSU rich Arabic speech database
Arabic is one of the major languages in the world. Unfortunately not so much research in
Arabic speaker recognition has been done. One main reason for this lack of research is the …
Arabic speaker recognition has been done. One main reason for this lack of research is the …
Speaker state recognition using an HMM-based feature extraction method
In this article we present an efficient approach to modeling the acoustic features for the tasks
of recognizing various paralinguistic phenomena. Instead of the standard scheme of …
of recognizing various paralinguistic phenomena. Instead of the standard scheme of …
Strategies for managing time and costs in speech corpus creation: insights from the Slovenian ARTUR corpus
The paper details the creation of an open access speech corpus for a less-resourced
language, covering the diversity in terms of accents, dialects, speech styles and …
language, covering the diversity in terms of accents, dialects, speech styles and …
[PDF][PDF] Multi-modal emotional database: AvID
Multi-Modal Emotional Database: AvID 1 Introduction 2 Recording strategies Page 1
Informatica 33 (2009) 101–106 101 Multi-Modal Emotional Database: AvID Rok Gajšek, Vitomir …
Informatica 33 (2009) 101–106 101 Multi-Modal Emotional Database: AvID Rok Gajšek, Vitomir …
Comparison of different classification methods for emotion recognition
The paper presents a comparison of different classification techniques for the task of
classifying a speaker's emotional state into one of two classes: aroused and normal. The …
classifying a speaker's emotional state into one of two classes: aroused and normal. The …
Emotion recognition using linear transformations in combination with video
The paper discuses the usage of linear transformations of Hidden Markov Models, normally
employed for speaker and environment adaptation, as a way of extracting the emotional …
employed for speaker and environment adaptation, as a way of extracting the emotional …
[PDF][PDF] Speech/non-speech segmentation based on phoneme recognition features
This work assesses different approaches for speech and non-speech segmentation of audio
data and proposes a new, high-level representation of audio signals based on phoneme …
data and proposes a new, high-level representation of audio signals based on phoneme …
[PDF][PDF] A voice-driven web browser for blind people.
A small self-voicing Web browser designed for blind users is presented. The Web browser
was built from the GTK Web browser Dillo, which is a free software project in terms of the …
was built from the GTK Web browser Dillo, which is a free software project in terms of the …
Building a rich Arabic speech database
Availability of databases is a necessity in the speech processing field. The publically
available databases in Arabic language are few. In this paper we describe a rich database …
available databases in Arabic language are few. In this paper we describe a rich database …