Speaker de-identification using diphone recognition and speech synthesis

T Justin, V Štruc, S Dobrišek, B Vesnicer… - 2015 11th IEEE …, 2015 - ieeexplore.ieee.org
The paper addresses the problem of speaker (or voice) de-identification by presenting a
novel approach for concealing the identity of speakers in their speech. The proposed …

KSU rich Arabic speech database

M Alsulaiman, G Muhammad, MA Bencherif… - Information …, 2013 - pure.ulster.ac.uk
Arabic is one of the major languages in the world. Unfortunately not so much research in
Arabic speaker recognition has been done. One main reason for this lack of research is the …

Speaker state recognition using an HMM-based feature extraction method

R Gajšek, F Mihelič, S Dobrišek - Computer Speech & Language, 2013 - Elsevier
In this article we present an efficient approach to modeling the acoustic features for the tasks
of recognizing various paralinguistic phenomena. Instead of the standard scheme of …

Strategies for managing time and costs in speech corpus creation: insights from the Slovenian ARTUR corpus

D Verdonik, A Bizjak, A Žgank, MS Maučec… - Language Resources …, 2024 - Springer
The paper details the creation of an open access speech corpus for a less-resourced
language, covering the diversity in terms of accents, dialects, speech styles and …

[PDF][PDF] Multi-modal emotional database: AvID

R Gajšek, V Štruc, F Mihelič, A Podlesek, L Komidar… - Informatica, 2009 - informatica.si
Multi-Modal Emotional Database: AvID 1 Introduction 2 Recording strategies Page 1
Informatica 33 (2009) 101–106 101 Multi-Modal Emotional Database: AvID Rok Gajšek, Vitomir …

Comparison of different classification methods for emotion recognition

T Justin, R Gajšek, V Štruc… - The 33rd International …, 2010 - ieeexplore.ieee.org
The paper presents a comparison of different classification techniques for the task of
classifying a speaker's emotional state into one of two classes: aroused and normal. The …

Emotion recognition using linear transformations in combination with video

R Gajšek, V Štruc, S Dobrišek, F Mihelič - Proc. Interspeech 2009, 2009 - isca-archive.org
The paper discuses the usage of linear transformations of Hidden Markov Models, normally
employed for speaker and environment adaptation, as a way of extracting the emotional …

[PDF][PDF] Speech/non-speech segmentation based on phoneme recognition features

J Žibert, N Pavešić, F Mihelič - EURASIP Journal on Advances in Signal …, 2006 - Springer
This work assesses different approaches for speech and non-speech segmentation of audio
data and proposes a new, high-level representation of audio signals based on phoneme …

[PDF][PDF] A voice-driven web browser for blind people.

B Vesnicer, J Zibert, S Dobrisek, N Pavesic… - …, 2003 - academia.edu
A small self-voicing Web browser designed for blind users is presented. The Web browser
was built from the GTK Web browser Dillo, which is a free software project in terms of the …

Building a rich Arabic speech database

MM Alsulaiman, G Muhammad… - 2011 Fifth Asia …, 2011 - ieeexplore.ieee.org
Availability of databases is a necessity in the speech processing field. The publically
available databases in Arabic language are few. In this paper we describe a rich database …